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NOVEL MEMBERS OF THE CAPSAICINA^ANILLOID RECEPTOR FAMILY 
OF PROTEINS AND USES THEREOF 



Background of the Invention 

5 Pain is initiated when the peripheral terminals of a subgroup of sensory neurons 

are activated by noxious chemical, mechanical or thermal stimuli. These neurons, called 
nociceptors, transmit information regarding tissue damage to pain-processing centres in 
the spinal chord and brain (Fields, H.L. Pain, McGraw-Hill, New York, 1987). 
Nociceptors are characterized in part, by their sensitivity to capsaicin, a vanilloid- 

1 0 containing compound, and a natural product of capsicum peppers that is the active 
ingredient of many "hot" and spicy foods. In mammals, exposure of nociceptor 
terminals to capsaicin leads initially to excitation of the neuron and the consequent 
perception of pain and local release of inflammatory mediators. With prolonged 
exposure, nociceptor terminals become insensitive to capsaicin, as well as to other 

15 noxious stimuli (Szolcsanyi, J. in Capsaicin in the Study of Pain (ed. Wood, J,) 1-26 
(Academic, London, 1993). This latter phenomenon of nociceptor desensitization 
underlies the seemingly paradoxical use of capsaicin as an analgesic agent in the 
treatment of painful disorders ranging from viral and diabetic neuropathies to 
rheumatoid arthritis (Campbell, E. in Capsaicin and the StudyofPain (ed. Wood, J.) 

20 255-272 (Academic, London, 1993); Szallasi, A. et al (1996) Pain 68, 195-208). Some 
of this decreased sensitivity to noxious stimuli may result from reversible changes in the 
nociceptor, but the long-term loss of responsiveness can be explained by death of the 
nociceptor or destruction of its peripheral terminals following exposure to capsaicin 
(Jancso, G. et al (1 977) Nature 270, 741-743). 

25 The cellular specificity of capsaicin action and its ability to evoke the sensation 

of burning pain have led to speculation that the target of capsaicin action plays an 
important physiological role in the detection of painful stimuli. Indeed, capsaicin may 
elicit the perception of pain by mimicking the actions of a physiological stimulus or an 
endogenous ligand produced during tissue injury (James, LF., Kinkina, N.N. & Wood, 

30 J.N. in Capsaicin in the Study of Pain (ed. Wood, J.N.) 83-104 (Academic, London, 
1993). 
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Caterina M.J. et al have recently determined the molecular basis underlying this 
phenomenon by characterizing a functional cDNA that encodes a vanilloid receptor 
(VR-1) in rat sensory ganglia (Caterina M. J. et al, (1997) Nature 389:816-824). VR-1 
is a vanilloid-gated, nonselective cation channel that resembles members of the transient 
5 receptor potential (TRP) channel family, first identified as components of the 

Drosophila phototransduction pathway (Montell et al (1989) Neuron 2:1313-1323). 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel 
members of the CapsaicinA^anilloid family of receptors. Described herein is the 

10 isolation of the human orthologue of rat VR-1 (rVR-1), referred to herein as hVR-1, as 
well as another previously unknown member of the VR family of receptors, referred 
herein as VR-2, and specifically as human VR-2 (hVR-2, including an alternate form 
which contains a deletion) and rat VR-2 (rVR-2) nucleic acid and protein molecules. 
The hVR-1, hVR-2, and rVR-2 molecules of the present invention are useful as targets 

15 for developing modulating agents to regulate a variety of cellular processes, e.g., cellular 
processes involved in pain. Accordingly, in one aspect, this invention provides isolated 
nucleic acid molecules encoding hVR-1, hVR-2, and rVR-2 proteins and fragments 
thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for 
the detection of hVR-1, hVR-2, and rVR-2-encoding nucleic acids. 

20 In one embodiment, an hVR-1, hVR-2, or rVR-2 nucleic acid molecule of the 

invention is at least 60%, 65%, 70%, 75%, 80%, 83%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to the 
nucleotide sequence {e.g., to the entire length of the nucleotide sequence) shown in SEQ 
ID N0:1, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof 

25 In another embodiment, the isolated nucleic acid molecule includes the 

nucleotide sequence shown SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, or a complement 
thereof. In another embodiment, the nucleic acid molecule includes at least 10, 15, 20, 
or more contiguous nucleotides of SEQ ID NO: I, 3, 4, 6, 7, 9, 10, or 12. 
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In another embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule 
includes a nucleotide sequence encoding a protein having an amino acid sequence 
sufficiently homologous to the amino acid sequence of SEQ ID N0:2, 5, 8, or 11. In 
one embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule includes a 
5 nucleotide sequence encoding a protein having an amino acid sequence at least 60%, 
65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more identical to the entire length 
of the amino acid sequence of SEQ ID N0:2, 5, 8, or 1 L 

Another embodiment of the invention features nucleic acid molecules, preferably 
hVR-1, hVR-2, and rVR-2 nucleic acid molecules, which specifically detect hVR-1, 

10 hVR-2, and rVR-2 nucleic acid molecules relative to nucleic acid molecules encoding 
non-hVR-1, non-hVR-2, and non-hVR-2 proteins. For example, in one embodiment, 
such a nucleic acid molecule is at least 100-150, 1 150-200, 200-250, 250-300, 300-350, 
350-400, 400-450, 450-500, 500-550, 550-600, 600-700, 700-800, 800-900, 900-1000, 
1088, or more nucleotides in length and hybridizes under stringent conditions to a 

15 nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID N0:1, 3, 4, 
6, 7, 9, 10, or 12. In preferred embodiments, the nucleic acid molecules are at least 15 
(e.g., contiguous) nucleotides in length and hybridize under stringent conditions to 
nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:L In other preferred 
embodiments, the nucleic acid molecules comprise nucleotides 1-17, 3696-3863, or 

20 3901-3909 of SEQ ID NO:L In yet other preferred embodiments, the nucleic acid 
molecules consist of nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In 
preferred embodiments, the nucleic acid molecules are at least 15 {e.g., contiguous) 
nucleotides in length and hybridize under stringent conditions to nucleotides 1944-2003 
of SEQ ID N0:4. In other preferred embodiments, the nucleic acid molecules comprise 

25 nucleotides 1 944-2003 of SEQ ID NO:4. In yet other preferred embodiments, the 
nucleic acid molecules consist of nucleotides 1944-2003 of SEQ ID NO:4. 

In other embodiments, the nucleic acid molecule encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID N0:2, 5, 
8, or 1 1 , wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 

30 consisting of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions and is 
encoded by the same locus as hVR-1, hVR-2 or rVR-2. 



wo 00/29577 PCT/US99/26701 

-5- 

Another embodiment of the invention provides a nucleic acid molecule that 
encodes a naturally occurring orthologue of a polypeptide comprising the amino acid 
sequence of SEQ ID N0:2, 5, 8, or 1 1, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule consisting of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12 under stringent 
5 conditions. 

Another embodiment of the invention provides an isolated nucleic acid molecule 
which is antisense to an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, e.g., the 
coding strand of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. 

Since the hVR2 (the alternate form) and rVR2 sequences represent fragments of 

10 the entire coding regions of these genes, another embodiment of the invention provides 
the complete gene sequences. A skilled artisan can readily isolate such molecule using 
the sequences disclosed herein. 

Another aspect of the invention provides a vector comprising an hVR-1, an hVR- 
2, or a rVR-2 nucleic acid molecule. In certain embodiments, the vector is a 

1 5 recombinant expression vector. In another embodiment, the invention provides a host 
cell containing a vector of the invention. In yet another embodiment, the invention 
provides a host cell containing a nucleic acid molecule of the invention. The invention 
also provides a method for producing a protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, by culturing in a suitable medium, a host cell, e.g., a mammalian host cell such 

20 as a non-human mammalian cell, of the invention containing a recombinant expression 
vector, such that the protein is produced. 

Another aspect of this invention features isolated or recombinant hVR-1, hVR-2, 
and rVR-2 proteins and polypeptides. In one embodiment, the isolated protein, 
preferably an hVR-1, hVR-2, or rVR-2 protein, includes at least one transmembrane 

25 domain. In another embodiment, the isolated protein, preferably an hVR-1, hVR-2, or 
rVR-2 protein, includes at least one transmembrane domain and at least one proline rich 
domain. In yet another embodiment, the isolated protein, preferably an hVR-1, hVR-2, 
or rVR-2 protein, includes at least one transmembrane domain, at least one proline rich 
domain, and at least one ankyrin repeat domain. In yet another embodiment, the protein, 

30 preferably an hVR-1 , hVR-2, or rVR-2 protein, includes at least one transmembrane 

domain, at least one proline rich domain, and at least one ankyrin repeat domain and has 
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an amino acid sequence at least about 60%, 65%. 70%, 75%, 80%, 85%, 87%, 90%, 
95%, 98% or more homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 
11 . In another embodiment, the protein, preferably an hVR-1 , hVR-2, or rVR-2 protein, 
includes at least one transmembrane domain, at least one proline rich domain, and at 

5 least one ankyrin repeat domain and plays a role in the development and regulation of 
pain. In yet another embodiment, the protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, includes at least one transmembrane domain, at least one proline rich domain, 
and at least one ankyrin repeat domain and is encoded by a nucleic acid molecule having 
a nucleotide sequence which hybridizes under stringent hybridization conditions to a 

10 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1,3,4, 6, 7, 9, 
10, or 12. 

In another embodiment, the invention features fragments of the protein having 
the amino acid sequence of SEQ ID N0:2, 5, 8, or 1 1, wherein the fragment comprises 
at least IS. 30, 40, 50, 60, 70, 80, 90, or 100 amino acids {e.g., contiguous amino acids). 
1 5 In another embodiment, the invention features an isolated protein, preferably an 

hVR-1, hVR-2, and rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence at least about 60%, 65%, 70%, 75%, 80%, 83%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 
more homologous to a nucleotide sequence of SEQ ID N0:1. 3, 4, 6, 7, 9, 10, or 12, or a 
20 complement thereof. This invention further features an isolated protein, preferably an 
hVR-1, hVR-2, or rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID 
N0:1, 3, 4, 6, 7, 9, 10, or 12, or a complement thereof. 

25 The proteins of the present invention or portions thereof, e.g. , biologically active 

portions thereof, can be operatively linked to a non-hVR-1 , non-hVR-2, or non-rVR-2 
polypeptide (e.g., heterologous amino acid sequences) to form fusion proteins. The 
invention further features antibodies, such as monoclonal or polyclonal antibodies, that 
specifically bind proteins of the invention, preferably hVR-1, hVR-2, and rVR-2 

30 proteins. In addition, the hVR-1 . hVR-2, and rVR-2 proteins or biologically active 
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portions thereof can be incorporated into pharmaceutical compositions, which optionally 
include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides a method for detecting the 
presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
5 in a biological sample by contacting the biological sample with an agent capable of 
detecting an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
such that the presence of an hVR-1, hVR.2, and rVR-2 nucleic acid molecule, protein or 
polypeptide is detected in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
10 presence of hVR-1, hVR-2, and rVR-2 activity in a biological sample by contacting the 
biological sample with an agent capable of detecting an indicator of hVR-1, hVR-2, and 
rVR-2 activity such that the presence of hVR-1, hVR-2, and rVR-2 activity is detected 

in the biological sample. 

In another aspect, the invention provides a method for modulating hVR-1, hVR- 
15 2, and rVR-2 activity comprising contacting a cell capable of expressing hVR-l, hVR-2, 
ai.d rVR-2 with an agent that modulates hVR-1 , hVR-2, and rVR-2 activity such that 
hVR-1, hVR-2, and rVR-2 activity in the cell is modulated. In one embodiment, the 
agent inhibits hVR-1, hVR-2, and rVR-2 activity. In another embodiment, the agent 
stimulates hVR-l, hVR-2, and rVR-2 activity. In one embodiment, the agent is an 
20 antibody that specifically binds to an hVR-1, hVR-2, and rVR.2 protein. In another 
embodiment, the agent modulates expression of hVR-I, hVR-2, and rVR-2 by 
modulating transcription of an hVR-1 , hVR-2, and rVR-2 gene or translation of an hVR- 
1, hVR-2, and rVR-2 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisense to the coding strand of an hVR- 
25 1, hVR-2, and rVR-2 mRNA or an hVR-1, hVR-2, and rVR-2 gene. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant hVR-l, hVR-2, and rVR-2 protein or 
nucleic acid expression or activity by administering an agent which is an hVR-1, hVR-2, 
and rVR-2 modulator to the subject. In one embodiment, the hVR-1, hVR-2, and rVR.2 
30 modulator is an hVR-1, hVR-2, and rVR-2 protein. In another embodiment the hVR-1, 
hVR-2, and rVR-2 modulator is an hVR-1, hVR-2. and rVR-2 nucleic acid molecule. In 
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yet another embodiment, the hVR-1 , hVR-2, and rVR-2 modulator is a peptide, 
peptidomimetic, or other small molecule. In a further embodiment, the disorder 
characterized by aberrant hVR-L hVR-2, and rVR-2 protein or nucleic acid expression 
is a pain disorder, e.g., hyperalgesia. 
5 The present invention also provides a diagnostic assay for identifying the 

presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding an hVR-1, hVR-2, and rVR-2 protein; (ii) 
mis-rcgulation of the gene; and (iii) aberrant post-translational modification of an hVR- 
1, hVR-2, and rVR-2 protein, wherein a wild-type form of the gene encodes a protein 
10 with an hVR- 1 , hVR-2, and rVR-2 activity (as described herein). 

In another aspect the invention provides a method for identifying a compound 
that binds to or modulates the activity of an hVR-1, hVR-2, and rVR-2 protein, by 
providing an indicator composition comprising an hVR-1, hVR-2, and rVR-2 protein 
having hVR-1, hVR-2, and rVR-2 activity, contacting the indicator composition with a 
1 5 test compound, and determining the effect of the test compound on hVR-1 , hVR-2, and 
rVR-2 activity in the indicator composition to identify a compound that modulates the 
activity of an hVR-1, hVR-2, and rVR-2 protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

20 

Brief Description of the Drawings 

Figure 1 depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-1 (hVR-1). The nucleotide sequence corresponds to nucleic 
acids 1 to 3909 of SEQ ID N0:1. The amino acid sequence corresponds to amino acids 
25 1 to 839 of SEQ ID N0:2. The coding region without the 5' and 3' untranslated regions 
of the human VR-1 (hVR-1) gene is shown in SEQ ID N0:3. 

Figure 2 depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-2 (hVR-2). The nucleotide sequence corresponds to nucleic 
acids 1 to 2809 of SEQ ID N0:4. The amino acid sequence corresponds to amino acids 
30 1 to 764 of SEQ ID N0:5. The coding region without the 5' and 3' untranslated regions 
of the human VR-2 (hVR-2) gene is shown in SEQ ID NO:6. 
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Figure 3 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of an alternate form of human VR-2 (hVR-2). The nucleotide sequence 
corresponds to nucleic acids 1 to 1489 of SEQ ID N0:7. The amino acid sequence 
corresponds to amino acids 1 to 436 of SEQ ID NO:8. The coding region without the 5' 
and 3" untranslated regions of the alternate form of human VR-2 (hVR-2) gene is shown 
in SEQ ID N0:9. 

Figure 4 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of rat VR-2 (rVR-2). The nucleotide sequence corresponds to nucleic acids 1 
to 1794 of SEQ ID NO: 10. The amino acid sequence corresponds to amino acids 1 to 
554 of SEQ ID NO: 1 1 . The coding region without the 5' and 3' untranslated regions of 
the rat VR-2 (rVR-2) gene is shown in SEQ ID NO: 12. 

Figure 5 depicts an alignment of the hVR-1 protein (SEQ ID N0:2) with the 
human VR-2 protein (SEQ ID N0:5) using the GAP program in the GCG software 
package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 

Figure 6 depicts an alignment of the hVR-1 nucleotide sequence (SEQ ID NO:l) 
with the human VR-2 nucleotide sequence (SEQ ID N0:4) using the GAP program in 
the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length 
weight of 3. 

Figure 7 depicts an alignment of the hVR-2 protein (SEQ ID NO: 5) with the rat 
VR-2 protein (SEQ ID N0:1 1) using the CLUSTAL W (1.74) multiple sequence 
alignment program. 

Figure 8 depicts an alignment of the hVR-2 protein (SEQ ID N0:5) with the rat 
VR-2 protein (SEQ ID N0:1 1) using the GAP program in the GCG software package 
(Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 

Figure 9 depicts an alignment of the hVR-1 nucleotide sequence (SEQ ID NO: 1) 
with the rat VR-1 nucleotide sequence (Accession Number: AF0293 10) using the GAP 
program in the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a 
length weight of 3. 

Figure 10 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the rat 
VR-1 protein (Accession Number: AF0293 10) usmg the GAP program in the GCG 
software package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
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Figure 1 J depicts an alignment of the hVR-2 protein (SEQ ID N0:5) with the 
human VR-2 protein (alternate form) (SEQ ID N0:8) using the CLUSTAL W (1 .74) 
multiple sequence alignment program. 

Figure 12 depicts a structural, hydrophobicity, and antigenicity analysis of the 

5 hVR-1 protein. 

Figure 13 depicts the results of a search using the amino acid sequence of the 
hVR-1 protein against the HMM database. 

Figure 14 depicts a structural, hydrophobicity, and antigenicity analysis of the 

hVR-2 protein. 

1 0 Figure J 5 depicts the results of a search using the amino acid sequence of the 

hVR-2 protein against the HMM database. 

Figure 16 depicts the predicted full length amino acid sequence of the human 
VR-2 protein (alternate form) (SEQ ID NO:20). 

Figure 77 depicts an alignment of the hVR-2 protein (SEQ ID N0:5) with the 
1 5 predicted full length human VR-2 protein (alternate form) (SEQ ID NO:20) using the 
CLUSTAL W (1 .74) multiple sequence alignment program. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of nucleic acid 
20 and amino acid molecules which are novel members of the CapsaicinA^anilloid family 
of receptors. Described herein is the isolation of the human orthologue of rat VR-1 
(rVR-I), referred to herein as hVR-1, as well as another previously unknown member of 
the VR family of receptors, referred herein as VR-2, and specifically as human VR-2 
(hVR-2) and rat VR-2 (rVR-2) nucleic acid and protein molecules. The hVR-1, hVR-2, 
25 and rVR-2 molecules were identified based on their sequence similarity to the known rat 
vanilloid receptor (VR-1). VR-1 is a vanilloid gated, non-selective cation channel 
which resembles members of the transient receptor potential (TRP) ion channel family 
(described in Montell et al. (1989) Neuron 2:1313-1323) that mediate the influx of 
extracellular calcium in response to depletion of intracellular calcium stores. The rat 
30 VR-1 cDNA contains an open reading frame of 2514 nucleotides that encodes a protein 
of 838 amino acids. Hydrophilicity analysis has indicated that rat VR-1 contains six 
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transmembrane domains (predicted to be mostly a-helices) with an additional short 
hydrophobic stretch between transmembrane regions 5 and 6. The amino terminal 
hydrophilic segment comains a relatively proline rich region followed by three ankyrin 
repeat domains. The rat VR-1 is expressed in small diameter neurons within sensory 
5 ganglia. The present hVR- 1 sequence is the human orthologue of rVR- 1 . As described 
in further detail infra, the human VR-1 is expressed in nodose, trigeminal sensory 
neurons, as well as in some, but not all, small dorsal root ganglion (DRG) neurons and 
in a few medium sized DRG neurons. 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention play a role in 
1 0 pain signaling mechanisms. As used herein, the term "pain signaling mechanisms" 
includes the cellular mechanisms involved in the development and regulation of pain, 
e.g., pain elicited by noxious chemical, mechanical, or thermal stimuli, in a subject, e.g., 
a mammal such as a human. In mammals, the initial detection of noxious chemical, 
mechanical, or thermal stimuli, a process referred to as "nociception", occurs 
1 5 predominantly at the peripheral terminals of specialized, small diameter primary afferent 
neurons, called polymodal nociceptors. These afferem neurons transmit the information 
to the central nervous system, evoking a perception of pain or discomfort and initiating 
appropriate protective reflexes. CapsaicinA/anilloid receptors, e.g., the hVR-1, hVR-2, 
and rVR-2 molecules of the present invention, present on these afferent neurons, are 
20 involved in detecting these noxious chemical, mechanical, or thermal stimuli and 
transducing this information into membrane depolarization events. Thus, the hVR-1, 
hVR-2, and rVR-2 molecules by participating in pain signaling mechanisms, can 
modulate pain elicitation and provide novel diagnostic targets and therapeutic agents to 
control pain. 

25 The hVR-1 , hVR-2, and rVR-2 molecules provide novel diagnostic targets and 

therapeutic agents to control pain in a variety of disorders, diseases, or conditions which 
are characterized by a deregulated, e.g., upregulated or downregulated, pain response. 
For example, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets 
and therapeutic agents to control the exaggerated pain response elicited during various 

30 forms of tissue injury, e.g. , inflammation, infection, and ischemia, usually referred to as 
hyperalgesia (described in, for example. Fields, H.L. (1987) Pain, New York:McGraw- 
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Hill). Moreover, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic 
targets and therapeutic agents to control pain associated with muscoloskeletal disorders, 
e.g., joint pain; tooth pain; headaches; pain associated with surgery, or neuropathic pain. 

As the hVR-1 gene maps to a region of human chromosome 17 between WI- 
5436 (7.7cR) and Wl-6584 (18.9cR) (Example 6), which has been associated with 
myasthenia gravis, Smith-Magenis syndrome, CORDS, Cone-rod dysrtophy, and breast 
cancer, the hVR-1 molecule may provide novel diagnostic targets and therapeutic agents 
to treat, diagnose, or prognose these disorders or other disorders linked to this 
chromosomal region. Similarly, as the hVR-2 gene maps to a region of human 
chromosome 17 between AFMA043ZB5 (23.3 cR) and D17S721 (29.3cR) (Example 6) 
which has been associated with myasthenia gravis, Smith-Magenis syndrome, C0RD5, 
Cone-rod dysrtophy, choroidal dystrophy, central areolar, and retinal cone dystrophy, 
the hVR-2 molecule may provide novel diagnostic targets and therapeutic agents to treat, 
diagnose, or prognose these disorders or other disorders linked to this chromo-somal 
region. 

The term "family" when referring to the protein and nucleic acid molecules of 
the invention is intended to mean two or more proteins or nucleic acid molecules having 
a common structural domain or motif and having sufficient amino acid or nucleotide 
sequence homology as defined herein. Such family members can be naturally or non- 
naturally occurring and can be from either the same or different species. For example, a 
family can contain a first protein of human origin, as well as other, distinct proteins of 
human origin or alternatively, can contain homologues of non-human origin. Members 
of a family may also have common functional characteristics. 

For example, the family of hVR-1, hVR-2, and rVR-2 proteins comprise at least 
one, and preferably six "transmembrane domains." As used herein, the term 
"transmembrane domain" includes an amino acid sequence of about 15 amino acid 
residues in length which spans the plasma membrane. More preferably, a 
transmembrane domain includes about at least 20, 25, 30, 35, 40, or 45 amino acid 
residues and spans the plasma membrane. Transmembrane domains are rich in 
hydrophobic residues, and typically have a helical structure. In a embodiment, at least 
50%, 60%, 70%, 80%, 90%, 95% or more of the amino acid residues of a 
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transmembrane domain are hydrophobic, e.g., leucines, isoleucines, tyrosines, or 
tryptophans. Transmembrane domains are described in, for example, Zagotta W.N- et 
al, (1996) Annual Rev. Neurosci, 19: 235-63, the contents of which are incorporated 
herein by reference. Amino acid residues 434-455, 480-495, (509-531; based on 
5 homology to the rat VR-1 ) or 5 14-53 1 , (543-569; based on homology to the rat VR-1) or 
538-555, (577-596; based on homology to the rat VR-1) or 580-599, and (656-683; 
based on homology to the rat VR-1) or 658-682 of hVR-1 (SEQ ID N0:2) and amino 
acid residues 391-410, 431-448, 459-476, 486-508, 538-556, and 621-645 of hVR-2 
(SEQ ID N0:5) comprise transmembrane domains. 

10 In another embodiment, an hVR-1, hVR-2, and rVR-2 of the present invention is 

identified based on the presence of a "proline rich domain" in the protein or 
corresponding nucleic acid molecule. As used herein, the term "proline rich domain" 
includes an amino acid sequence of about 4-6 amino acid residues in length having the 
general sequence X-Pro-X-X-Pro-X (where X can be any amino acid). Proline rich 

1 5 domains are usually located in a helical structure and bind through hydrophobic 
interactions to SH3 domains. SH3 domains recognize proline rich domains in both 
forward and reverse orientations. Proline rich domains are described in, for example, 
Sattler M. et al (1998) Leukemia 12:637-644, the contents of which are incorporated 
herein by reference. 

20 In another embodiment, an hVR-1 , hVR-2, and rVR-2 of the present invention is 

identified based on the presence of an "ankyrin repeat domain" in the protein or 
corresponding nucleic acid molecule. As used herein, the term "ankyrin repeat domain" 
includes a protein domain having an amino acid sequence of about 30-50 amino acid 
residues and having a bit score for the alignment of the sequence to the ankyrin repeat 

25 domain (HMM) of at least 6, Preferably, an ankyrin repeat domain includes at least 
about 30-45, more preferably about 30-40 amino acid residues, or about 30-35 amino 
acids and has a bit score for the alignment of the sequence to the ankyrin repeat domain 
(HMM) of at least 3-10, more preferably 10-30, more preferably 30-50, even more 
preferably 50-75, 75-100, 100-200 or greater. The ankyrin repeat domain HMM has 

30 been assigned the PFAM Accession PF00023 (http://genome.v^stl.edu/Pfam/.html), 
Ankyrin repeats are involved in protein-protein interactions and are described in, for 



wo 00/29577 PCT/US9 9/2 6701 

- 14- 

example, Ketchum K.A ei ai (1996) FEES Letters 378:19-26, the contents of which are 
incorporated herein by reference. 

To identify the presence of an ankyrin repeat domain in an hVR-1 , hVR-2, and 
rVR-2 protein and make the determination that a protein of interest has a particular 
profile, the amino acid sequence of the protein is searched against a database of HMMs 
{e.g., the Pfam database, release 2.1) using the defauh parameters 
(http://www,sanger.ac.uk/Software/Pfam/HMM_search). A description of the Pfam 
database can be found in Sonhammer et al. (1997) Proteins 28(3)405-420 and a detailed 
description of HMMs can be found, for example, in Gribskov ct ai{\99Q) Metk 
Enzymol 183:146-159; Gnhs\ioy et al.{\9%l) Proc, Natl Acad. ScL [75^ 84:4355- 
4358; Krogh etai{\99A)l Mol Biol 235:1501-1531; and Stultz et al(l993) Protein 
ScL 2:305-3 14, the contents of which are incorporated herein by reference. A search 
was performed against the HMM database resulting in the identification of three ankyrin 
repeat domains in the amino acid sequence of SEQ ID NO:2 (at about residues 201-233, 
248-283, and 333-361) and SEQ ID N0:5 (at about residues 162-194, 208-243, and 293- 
328). The results of the searches are set forth in Figures 13 and 15. 

Isolated proteins of the present invention, preferably hVR-l, hVR-2, and rVR-2 
proteins, have an amino acid sequence sufficiently identical to the amino acid sequence of 
SEQ ID N0:2, 5, 8, or 1 1 or are encoded by a nucleotide sequence sufficiently identical to 
SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12. As used herein, the term ''sufficiently identical" 
refers to a first amino acid or nucleotide sequence which contains a sufficient or minimum 
number of identical or equivalent (e.g., an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide sequences share common structural 
domains or motifs and/or a common functional activity. For example, amino acid or 
nucleotide sequences which share common structural domains have at least 30%, 40%, or 
50% identity, preferably 60% identity, more preferably 70%-80%, and even more 
preferably 90-95% identity across the amino acid sequences of the domains and contain at 
least one and preferably two structural domains or motifs, are defined herein as sufficiently 
identical. Furthermore, amino acid or nucleotide sequences which share at least 30%, 40%, 
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or 50%, preferably 60%, more preferably 70-80%, or 90-95% identity and share a common 
functional activity are defined herein as sufficiently identical. 

As used interchangeably herein, an "hVR-1, hVR-2, and rVR-2 activity", "biological 
activity of hVR-1, hVR-2, and rVR-2" or "functional activity of hVR-1, hVR-2, and rVR-2'*, 
5 refers to an activity exerted by an hVR-1, hVR-2, and rVR-2 protein, polypeptide or nucleic 
acid molecule on an hVR-1, hVR-2, and rVR-2 responsive cell or on an hVR-1, hVR-2, and 
rVR-2 protein substrate, as determined in vivo, or in vitro, according to standard techniques. 
In one embodiment, an hVR-1, hVR-2, and rVR-2 activity is a direct activity, such as an 
association with an hVR-1, hVR-2, and rVR-2-target molecule. As used herein, a "target 

1 0 molecule'* or "binding partner" is a molecule with which an hVR-1 , hVR-2, and rVR-2 

protein binds or interacts in nature, such that hVR-1, hVR-2, and rVR-2-mediated function is 
achieved. An hVR-1, hVR-2, and rVR-2 target molecule can be a non-hVR-1, non-hVR-2, 
and non-rVR-2 molecule or an hVR-1 , hVR-2, and rVR-2 protein or polypeptide of the 
present invention. In an exemplary embodiment, an hVR-1, hVR-2, and rVR-2 target 

15 molecule is an hVR-1, hVR-2, and rVR-2 ligand, e.g., capsaicin. Alternatively, an hVR-1, 
hVR-2, and rVR-2 activity is an indirect activity, such as a cellular signaling activity 
mediated by interaction of the hVR-1, hVR-2, and rVR-2 protein with an hVR-1, hVR-2, and 
rVR-2 ligand. 

Accordingly, another embodiment of the invention features isolated hVR-1 , 
20 hVR-2, and rVR-2 proteins and polypeptides having an hVR-1 , hVR-2, and rVR-2 

activity. Other proteins of the invention are hVR-1 , hVR-2, and rVR-2 proteins having 
at least one, and preferably six, transmembrane domains and, preferably, an hVR-1, 
hVR-2, and rVR-2 activity. Yet other proteins of the invention are hVR-1, hVR-2, and 
rVR-2 proteins having at least one transmembrane domain, at least one proline rich 
25 domain and, preferably, an hVR-1 , hVR-2, and rVR-2 activity. Other proteins of the 
invention are hVR-l, hVR-2, and rVR-2 proteins having at least one transmembrane 
domain, at least one proline rich domain, at least one ankyrin repeat domain and, 
preferably, an hVR-I, hVR-2, and rVR-2 activity. Additional proteins of the invention 
have at least one transmembrane domain, at least one proline rich domain, at least one 
30 ankyrin repeat domain, and are, preferably, encoded by a nucleic acid molecule having a 
nucleotide sequence which hybridizes under stringent hybridization conditions to a 
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nucleic acid molecule comprising the nucleotide sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 
10, or 12. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted 
amino acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID 
5 NOs: 1 and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID N0s:4 
and 5, respectively. 

The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 
10 predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shovm in 
Figure 3 and in SEQ ID N0s:7 and 8, respectively. 

The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 
acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 
and 1 1 , respectively. 

15 Various aspects of the invention are described in further detail in the following 

subsections: 



I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 

20 encode hVR-1 , hVR-2, and rVR-2 proteins or biologically active portions thereof, as 
well as nucleic acid fragments sufficient for use as hybridization probes to identify hVR- 
1, hVR-2, and rVR-2-encoding nucleic acid molecules {e.g., hVR-1, hVR-2, and rVR-2 
mRNA) and fragments for use as PGR primers for the amplification or mutation of hVR- 
1, hVR-2, and rVR-2 nucleic acid molecules. As used herein, the term "nucleic acid 

25 molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and 
RNA molecules (^.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which 

30 are separated fh)m other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" 
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includes nucleic acid molecules which are separated from the chromosome with which 
the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (Le., sequences located at the 5' and 
3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
5 acid is derived. For example, in various embodiments, the isolated hVR-1, hVR-2, and 
rVR-2 nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 
kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of 
10 other cellular material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the present invention, e.g., el nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12. Using all or 

15 portion of the nucleic acid sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, as a 

hybridization probe, hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., 
Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

20 NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO: 1, 3, 4, 6, 7, 9, 10, or 12, can be isolated by the polymerase chain reaction (PCR) 
using synthetic oligonucleotide primers designed based upon the sequence of SEQ ID 
NO: 1,3, 4, 6, 7, 9, 10, or 12. 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to hVR-1, hVR-2, and rVR-2 nucleotide 

30 sequences can be prepared by standard synthetic techniques, e.g, , using an automated 
DNA synthesizer. 
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In one embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO: I . The sequence of SEQ ID NO: 1 
corresponds to the full length hVR-1 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
5 comprises the nucleotide sequence shown in SEQ ID N0:4. The sequence of SEQ ID 
N0:4 corresponds to the full length hVR-2 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO: 7. The sequence of SEQ ID 
N0:7 corresponds to a fragment of the hVR-2 (alternate form) encoding cDNA, 
10 In another embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID N0:1 0. The sequence of SEQ ID 
NO: 10 corresponds to a fragment of the rVR-2 cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
15 shown in SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide 
sequences. A nucleic acid molecule which is complementary to the nucleotide sequence 
shown in SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, is one which is sufficiently 
complementary to the nucleotide sequence shown in SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 
12, such that it can hybridize to the nucleotide sequence shown in SEQ ID N0:1, 3, 4, 6, 
20 7, 9, 10, or 12 thereby forming a stable duplex. 

In still another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60%, 65%, 70%, 75%, 
80%, 83%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% or more homologous to the entire length of the nucleotide sequence shown in 
25 SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, for example, a 
fragment which can be used as a probe or primer or a fragment encoding a portion of an 
hVR-1, hVR-2, and rVR-2 protein, e.g., a biologically active portion of an hVR-1, hVR- 
30 2, and rVR-2 protein. The nucleotide sequence determined from the cloning of the 
hVR-1, hVR-2, and rVR-2 gene allows for the generation of probes and primers 



wo 00/29577 PCT/US99/26701 

- 19- 

designed for use in identifying and/or cloning other hVR-1, hVR-2, and rVR-2 family 
members, as well as hVR-1 , hVR-2, and rVR-2 homologues from other species. The 
probe/primer typically comprises a substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
5 under stringent conditions to at least about 12 or 15, preferably about 20 or 25, more 
preferably about 30, 35, 40, 45, 50, 55, 60, 65, 75, or 100 consecutive nucleotides of a 
sense sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, of an anti-sense sequence of 
SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, or of a naturally occurring allelic variant or mutant 
of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12. In an exemplary embodiment, a nucleic acid 

1 0 molecule of the present invention comprises a nucleotide sequence which is greater than 
100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-550, 
550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000, 
1088, or more nucleotides in length and hybridizes under stringent hybridization 
conditions to a nucleic acid molecule of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12. 

15 Probes based on the hVR-1 , hVR-2, and rVR-2 nucleotide sequences can be used 

to detect transcripts or genomic sequences encoding the same or homologous proteins. 
In preferred embodiments, the probe further comprises a label group attached thereto, 
e.g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for 

20 identifying cells or tissue which misexpress an hVR-1 , hVR-2, and rVR-2 protein, such 
as by measuring a level of an hVR-1, hVR-2, and rVR-2-encoding nucleic acid in a 
sample of cells from a subject e.g., detecting hVR-1, hVR-2, and rVR-2 mRNA levels or 
determining whether a genomic hVR-1, hVR-2, and rVR-2 gene has been mutated or 
deleted. 

25 A nucleic acid fragment encoding a "biologically active portion of an hVR-1, 

hVR-2, and rVR-2 protein" can be prepared by isolating a portion of the nucleotide 
sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, which encodes a polypeptide having 
an hVR-1, hVR-2, and rVR-2 biological activity (the biological activities of the hVR-1, 
hVR-2, and rVR-2 proteins are described herein), expressing the encoded portion of the 

30 hVR-1, hVR-2, and rVR-2 protein (e.g., by recombinant expression in vitro) and 

assessing the activity of the encoded portion of the hVR-1, hVR-2, and rVR-2 protein. 
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The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, due to degeneracy 
of the genetic code and thus encode the same hVR-1, hVR-2, and rVR-2 proteins as 
those encoded by the nucleotide sequence shown in SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 
5 12. In another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
IDN0:2, 5, 8, or 11. 

In addition to the hVR-1, hVR-2, and rVR-2 nucleotide sequences shown in SEQ 
ID N0;1, 3, 4, 6, 7, 9, 10, or 12, it will be appreciated by those skilled in the art that 

10 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the 
hVR-1, hVR-2, and rVR-2 proteins may exist within a population (e,g., the human 
population). Such genetic polymorphism in the hVR-1, hVR-2, and rVR-2 genes may 
exist among individuals within a population due to natural allelic variation. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules which 

15 include an open reading frame encoding an hVR-1, hVR-2, and rVR-2 protein, 

preferably a mammalian hVR-1, hVR-2, and rVR-2 protein, and can further include non- 
coding regulatory sequences, and introns. 

Allelic variants of hVR-1, hVR-2, and rVR-2 include both functional and non- 
fimctional hVR-1, hVR-2, and rVR-2 proteins. Functional allelic variants are naturally 

20 occurring amino acid sequence variants of the hVR-1, hVR-2, and rVR-2 protein that 
maintain the ability to bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain 
signaling mechanism. Functional allelic variants will typically contain only 
conservative substitution of one or more amino acids of SEQ ID N0:2, 5, 8, or 1 1, or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 

25 protein. 

Non-functional allelic variants are naturally occurring amino acid sequence 
variants of the hVR-1, hVR-2, and rVR-2 protein that do not have the ability to either 
bind an hVR-1 , hVR-2, and rVR-2 ligand and/or modulate a pain signaling mechanism. 
Non-functional dlelic variants will typically contain a non-conservative substitution, a 
30 deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID 
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N0:2, 5, 8, or 1 1, or a substitution, insertion or deletion in critical residues or critical 
regions. 

The present invention further provides non-human orthologues of the hVR-2 and 
rVR-2 protein. Orthologues of the hVR-2 and rVR-2 protein are proteins that are 
5 isolated from non-human and non-rat organisms and possess the same hVR-2 and rVR- 
2 ligand binding and/or modulation of pain signaling mechanism capabilities of the 
hVR-2 and rVR-2 proteins. Orthologues of the hVR-2 and rVR-2 proteins can readily 
be identified as comprising an amino acid sequence that is substantially homologous to 
SEQ ID NO: 4, 6, 8 or 10. 

10 Moreover, nucleic acid molecules encoding other hVR-1, hVR-2, and rVR-2 

family members and, thus, which have a nucleotide sequence which differs from the 
hVR-1, hVR-2, and rVR-2 sequences of SEQ ID NO:!, 3, 4, 6, 7, 9, 10, or 12, are 
intended to be within the scope of the invention. For example, another hVR- 1 , hVR-2, 
and rVR-2 cDNA can be identified based on the nucleotide sequence of hVR- 1 , hVR-2, 

15 andrVR-2. Moreover, nucleic acid molecules encoding VR-2 proteins from different 
species, and which, thus, have a nucleotide sequence which differs from the hVR-2 and 
rVR-2 sequences of SEQ ID N0:4, 6, 8, or 10 are intended to be within the scope of the 
invention. For example, a mouse hVR-2 cDNA can be identified based on the 
nucleotide sequence of the human VR-2 (hVR-2) or the rat VR-2 (rVR-2). 

20 Nucleic acid molecules corresponding to natural allelic variants and homologues 

of the hVR-1, hVR-2, and rVR-2 cDNAs of the invention can be isolated based on their 
homology to the hVR-1, hVR-2, and rVR-2 nucleic acids disclosed herein using the 
cDNAs disclosed herein, or a portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent hybridization conditions. Nucleic acid 

25 molecules corresponding to natural allelic variants and homologues of the hVR-1, hVR- 
2, and rVR-2 cDNAs of the invention can further be isolated by mapping to the same 
chromosome or locus as the hVR-1, hVR-2, and rVR-2 gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15, 20, 25, 30 or more nucleotides in length and hybridizes under 

30 stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12. In other embodiment, the nucleic acid is at least 
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30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 
900, or 950 nucleotides in length. As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for hybridization and washing under 
which nucleotide sequences at least 60% identical to each other typically remain 
5 hybridized to each other. Preferably, the conditions are such that sequences at least 
about 70%, more preferably at least about 80%, even more preferably at least about 85% 
or 90% identical to each other typically remain hybridized to each other. Such stringent 
conditions are known to those skilled in the art and can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non- 
10 limiting example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X 
SSC, 0.1% SDS at 50°C, preferably at 55°C, more preferably at 60°C. and even more 
preferably at 65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, 
15 or 12 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of the hVR-I , hVR-2, and rVR-2 
sequences that may exist in the population, the skilled artisan will further appreciate that 
20 changes can be introduced by mutation into the nucleotide sequences of SEQ ID NO: 1,3, 
4, 6, 7, 9, 10, or 12, thereby leading to changes in the amino acid sequence of the encoded 
hVR-1, hVR-2, and rVR-2 proteins, v^thout altering the functional ability of the hVR-l, 
hVR-2, and rVR-2 proteins. For example, nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues can be made in the sequence of SEQ 
25 ID N0:1, 3, 4, 6, 7, 9, 10, or 12. A "non-essential" amino acid residue is a residue that 
can be altered from the wild-type sequence of hVR-1, hVR-2, and rVR-2 (e.g., the 
sequence of SEQ ID NO:2, 5, 8, or 1 1) without altering the biological activity, whereas 
an "essential" amino acid residue is required for biological activity. For example, amino 
acid residues that are conserved among the hVR-1 , hVR-2, and rVR-2 proteins of the 
30 present invention, are predicted to be particularly unamenable to alteration. Furthermore, 
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additional amino acid residues that are conserved between the hVR-1, hVR-2, and rVR-2 
proteins of the present invention and other members of the CapsaicinA^anilloid receptor 
family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding hVR-1 , hVR-2, and rVR-2 proteins that contain changes in amino acid residues 
that are not essential for activity. Such hVR-1 , hVR-2, and rVR-2 proteins differ in 
amino acid sequence from SEQ ID N0:2, 5, 8, or 11, yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 

10 60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to.SEQ 
ID NO:2,5, 8, or 11. 

An isolated nucleic acid molecule encoding an hVR-1, hVR-2, and rVR-2 protein 
homologous to the protein of SEQ ID N0:2, 5, 8, or II can be created by introducing one 
or more nucleotide substitutions, additions or deletions into the nucleotide sequence of 

15 SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced into SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, by standard techniques, such as 
site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more predicted non-essential amino acid 

20 residues. A "conservative amino acid substitution" is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
amino acid residues having similar side chains have been defined in the art. These 
families include amino acids with basic side chains (e.g. , lysine, arginine, histidine), 
acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 

25 glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), 
beta-branched side chains (e.g. , threonine, valine, isoleucine) and aromatic side chains 

tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in an hVR-1 , hVR-2, and rVR-2 protein is preferably replaced with 

30 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an hVR- 1 , hVR- 
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2, and rVR-2 coding sequence, such as by saturation mutagenesis, and the resultant 
mutants can be screened for hVR-1, hVR-2, and rVR-2 biological activity to identify 
mutants that retain activity. Following mutagenesis of SEQ ID N0;1, 3, 4, 6, 7, 9, 10, or 
12. 

5 In a embodiment, a mutant hVR-1 , hVR-2, and rVR-2 protein can be assayed for 

the ability to (1) interact with a non-hVR-1, non-hVR-2, or non- rVR-2 protein molecule, 
e.g., a vanilloid compound such as capsaicin; (2) modulate intracellular calcium 
concentration; (3) activate an hVR-1, hVR-2, and rVR-2-dependent signal transduction 
pathway; or (4) modulate a pain signaling mechanism. 

10 In addition to the nucleic acid molecules encoding hVR-1 , hVR-2, and rVR-2 

proteins described above, another aspect of the invention pertains to isolated nucleic acid 
molecules which are antisense thereto. An "antisense" nucleic acid comprises a 
nucleotide sequence which is complementary to a "sense" nucleic acid encoding a 
protein, e.g., complementary to the coding strand of a double-stranded cDNA molecule or 

15 complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire hVR-1, hVR-2, and rVR-2 coding strand, or to only a portion thereof In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding hVR-1, hVR-2, and rVR-2. The term 

20 "coding region" refers to the region of the nucleotide sequence comprising codons which 
are translated into amino acid residues (e.g., the coding region of hVR-1, hVR-2, and 
rVR-2). In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding hVR-1, hVR- 
2, and rVR-2. The term "noncoding region'* refers to 5' and 3' sequences which flank the 

25 coding region that are not translated into amino acids (/.e., also referred to as 5* and 3' 
untranslated regions). 

Given the coding strand sequences encoding hVR-1, hVR-2, and rVR-2 disclosed 
herein, antisense nucleic acids of the invention can be designed according to the rules of 
Watson and Crick base pairing. The antisense nucleic acid molecule can be 

30 complementary to the entire coding region of hVR-1, hVR-2, and rVR-2 mRNA, but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
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or noncoding region of hVR-1, hVR-2, and rVR-2 mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site 
of hVR-1, hVR-2, and rVR-2 mRNA. An antisense oligonucleotide can be, for example, 
about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic 
5 acid of the invention can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., 
an antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 

15 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

20 N6-isopenienyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, 

queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouraciI, 3- 
(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Altematively, 
the antisense nucleic acid can be produced biologically using an expression vector into 

25 which a nucleic acid has been subcloned in an antisense orientation RNA 

transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

30 and/or genomic DNA encoding an hVR-1, hVR-2, and rVR-2 protein to thereby inhibit 
expression of the protein, e.g., by inhibiting transcription and/or translation. The 
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hybridization can be by conventional nucleotide complementarity to form a stable 
duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of the 
5 invention include direct injection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 
10 cell surface receptors or antigens. The antisense nucleic acid molecules can also be 
delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred, 

1 5 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an -anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual -units, the strands run parallel to each other (Gaultier et ai (1987) Nucleic Acids. 
Res, 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 

20 methylribonucleotide (Inoue et al (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al (1987) FEES Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 

25 have a complementary region. Thus, ribozymes (e.g^., hammerhead ribozymes 
(described in Haselhoff and Geriach (1988) Nature 334:585-591)) can be used to 
catalyticaliy cleave hVR-1, hVR-2, and rVR-2 mRNA transcripts to thereby inhibit 
translation of hVR-1, hVR-2, and rVR-2 mRNA. A ribozyme having specificity for an 
hVR-1 , hVR-2, and rVR-2-encoding nucleic acid can be designed based upon the 

30 nucleotide sequence of an hVR-1, hVR-2, and rVR-2 cDNA disclosed herein (/.e, SEQ 
ID N0:1, 3, 4, 6, 7, 9, 10, or 12). For example, a derivative of a Tetrahymena L-19 IVS 
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RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an hVR-1 , hVR-2, and rVR- 
2-encoding mRNA. See, e.g., Cech et ai U.S. Patent No. 4,987,071 ; and Cech et al 
U.S. Patent No. 5,1 16,742. Alternatively, hVR-1, hVR-2, and rVR-2 mRNA can be 
5 used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules. See, for example, Bartel, D. and Szostak, J.W. (1 993) Science 
261:1411-1418. 

Alternatively, hVR-1, hVR-2, and rVR-2 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the hVR- 1 , 

1 0 hVR-2, and rVR-2 {e.g. , the hVR- 1 , hVR-2, and rVR-2 promoter and/or enhancers) to 
form triple helical structures that prevent transcription of the hVR-1, hVR-2, and rVR-2 
gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569- 
84; Helene, C. et ai (1992) Ann. N. Y. Acad Sci. 660:27-36; and Maher, L.J. (1992) 
Bioassays 14(12):807-15. 

1 5 In yet another embodiment, the hVR-1 , hVR-2, and rVR-2 nucleic acid 

molecules of the present invention can be modified at the base moiety, sugar moiety or 
phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate backbone of the nucleic acid 
molecules can be modified to generate peptide nucleic acids (see Hyrup B, et ai. (1996) 

20 Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms "peptide 
nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g.^ DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the 
four natural nucleobases are retained. The neutral backbone of PNAs has been shovm to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 

25 strength. The synthesis of PNA oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup B. et al (1996) supra; Perry-O'Keefe 
etal. Proc. Natl. Acad. Sci. 93: 14670-675. 

PNAs of hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be used in 
therapeutic and diagnostic applications. For example, PNAs can be used as antisense or 

30 antigene agents for sequence-specific modulation of gene expression by, for example, 
inducing transcription or translation arrest or inhibiting replication. PNAs of hVR-1, 
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hVR-2, and rVR-2 nucleic acid molecules can also be used in the analysis of single base 
pair mutations in a gene, {e.g. , by PN A-directed PCR clamping); as 'anificial restriction 
enzymes* when used in combination with other enzymes, (e.g., SI nucleases (Hyrup B. 
(1996) supra)); or as probes or primers for DNA sequencing or hybridization (Hyrup B. 
5 e( al. ( 1 996) supra; Perry-O'Keefe supra). 

In another embodiment, PNAs of hVR-1, hVR-2, and rVR-2 can be modified, 
(e.g., to enhance their stability or cellular uptake), by attaching lipophilic or other helper 
groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or 
other techniques of drug delivery known in the art. For example, PNA-DNA chimeras 

10 of hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allov/ DNA 
recognition enzymes, (e.g., RNAse H and DNA polymerases), to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 

1 5 of base stacking, number of bonds between the nucleobases, and orientation (Hyrup B. 
(1996) supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup B. (1996) supra and Finn P.J. et al (1996) Nucleic Acids Res, 24 (17): 3357-63. 
For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5'-(4- 

20 methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used as a between the 
PNA and the 5' end of DNA (Mag, M.e/ a/. (1989) Nucleic Acid Res. 17: 5973-88). 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5' PNA segment and a 3' DNA segment (Fiiui P.J. ei al (1996) supra). 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' 

25 PNA segment (Peterser, K.H. et al. (1975) Bioorganic Med Chem, Lett. 5:1119-1 1 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e,g., Letsinger et al. (1989) Proc. Natl. Acad, 
Sci, USA 86:6553-6556; Lemaitre et al, (1987) Proc. Natl Acad Set USA 84:648-652; 

30 PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCX Publication 
No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
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triggered cleavage agents (See, e.g., Krol e( ai (1988) Bio-Techniques 6:958-976) or 
intercalating agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 

5 

II. Isolated hVR-h hVR-2, and rVR-2 Proteins and Anti-hVR-1, Anti-hVR-2, and Anti- 
rVR-2 Antibodies 

One aspect of the invention pertains to isolated hVR-1, hVR-2, and rVR-2 
proteins, and biologically active portions thereof, as well as polypeptide fragments 

10 suitable for use as immunogens to raise anti-hVR-2, anti-hVR-2, and anti-rVR-2 
antibodies. In one embodiment, native hVR-1, hVR-2, and rVR-2 proteins can be 
isolated from cells or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, hVR-1, hVR-2, and 
rVR-2 proteins are produced by recombinant DNA techniques. Alternative to 

1 5 recombinant expression, an hVR-1 , hVR-2, and rVR-2 protein or polypeptide can be 
synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or '^purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the hVR-1, hVR-2, and rVR-2 protein is derived, or 

20 substantially free from chemical precursors or other chemicals when chemically 

synthesized. The language "substantially free of cellular material" includes preparations 
of hVR-1, hVR-2, and rVR-2 protein in which the protein is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations 

25 of hVR-1, hVR-2, and rVR-2 protein having less than about 30% (by dry weight) of 
non-hVR-1, hVR-2, and rVR-2 protein (also referred to herein as a "contaminating 
protein"), more preferably less than about 20% of non-hVR-1, hVR-2, and rVR-2 
protein, still more preferably less than about 10% of non-hVR-1, hVR-2, and rVR-2 
protein, and most preferably less than about 5% non-hVR-1 , non-hVR-2, and non-rVR-2 

30 protein. When the hVR-1, hVR-2, and rVR-2 protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
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medium, /.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" 
5 includes preparations of hVR-1, hVR-2, and rVR-2 protein in which the protein is 
separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of hVR-1, hVR-2, and 
rVR-2 protein having less than about 30% (by dry weight) of chemical precursors or 
1 0 non-hVR-1 , hVR-2, and rVR-2 chemicals, more preferably less than about 20% 

chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, still more preferably 
less than about 10% chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, 
and most preferably less than about 5% chemical precursors or non-hVR-1, hVR-2, and 
rVR-2 chemicals. 

15 As used herein, a "biologically active portion" of an hVR-1, hVR-2, and rVR-2 

protein includes a fragment of an hVR-1, hVR-2, and rVR-2 protein which participates 
in an interaction between an hVR-1, hVR-2, and rVR-2 molecule and a non-hVR-1, 
non-hVR-2, and non-rVR-2 molecule, respectively. Biologically active portions of an 
hVR-1, hVR-2, and rVR-2 protein include peptides comprising amino acid sequences 
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In one embodiment, a biologically active portion of an hVR-1, hVR-2, and rVR- 
2 protein comprises at least one transmembrane domain, and/or at least one proline rich 
domain, and/or at least one ankyrin repeat domain. It is to be understood that a 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein of the present 
5 invention may contain at least one of the above-identified structural domains. A more 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein may contain at least 
two of the above-identified structural domains. Moreover, other biologically active 
portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of a 

1 0 native h VR-1 , hVR-2, and rVR-2 protein. 

In a embodiment, the hVR-1, hVR-2, and rVR-2 protein has an amino acid 
sequence shovm in SEQ ID N0:2, 5, 8, or II. In other embodiments, the hVR-1, hVR- 
2, and rVR-2 protein is substantially homologous to SEQ ID N0:2, 5, 8, or 1 1, and 
retains the functional activity of the protein of SEQ ID N0:2, 5, 8, or 1 1 , yet differs in 

1 5 amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Accordingly, in another embodiment, the hVR-1, hVR-2, 
and rVR-2 protein is a protein which comprises an amino acid sequence at least about 
60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to SEQ 
IDNO:2,5, 8,orll. 

20 To determine the percent identity of two amino acid sequences or of two nucleic 

acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a embodiment, the length of a reference sequence aligned for 

25 comparison purposes is at least 30%, preferably at least 40%, more preferably at least 
50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 
90% of the length of the reference sequence (e.g., when aligning a second sequence to 
the hVR-1, hVR-2, and rVR-2 amino acid sequence of SEQ ID N0:2, 5, 8, or 1 1, having 
177 amino acid residues, at least 80, preferably at least 100, more preferably at least 120, 

30 even more preferably at least 140, and even more preferably at least 150, 160 or 170 
amino acid residues are aligned). The amino acid residues or nucleotides at 
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corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide 
as the corresponding position in the second sequence, then the molecules are identical at 
that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino 
5 acid or nucleic acid "homology"). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, which need to be introduced for 
optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 

10 sequences can be accomplished using a mathematical algorithm. In a embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (J. Mol Biol (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 

15 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another embodiment, the 
percent identity between two nucleotide sequences is determined using the GAP 
program in the GCG software package (available at http://www.gcg.com), using a 
NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino 

20 acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. 

Miller (CABIOS, 4: 1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a 
gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
25 used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed 
using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1990) J, 
Mol Biol 21 5:403-10, BLAST nucleotide searches can be performed with the 
NBLAST program, score = 1 00, wordlength = 12 to obtain nucleotide sequences 
30 homologous to hVR-1 , hVR-2, and rVR-2 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 



wo 00/29577 PCT/l)S99/26701 

-33 - 

wordlength = 3 to obtain amino acid sequences homologous to hVR-1, hVR-2, and rVR- 
2 protein molecules of the invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in Altschul et ai, (1997) Nucleic 
Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
5 the default parameters of the respective programs (e,g. , XBLAST and NBLAST) can be 
used. See http://wvsw.ncbi.nlm.nih.gov. 

The invention also provides hVR-1, hVR-2, and rVR-2 chimeric or fusion 
proteins. As used herein, an hVR-1, hVR-2, and rVR-2 "chimeric protein" or "fusion 
protein" comprises an hVR-1, hVR-2, and rVR-2 polypeptide operatively linked to a 

1 0 non-hVR- 1 , hVR-2, and rVR-2 polypeptide. An "h VR- 1 , hVR-2, and rVR-2 

polypeptide" refers to a polypeptide having an amino acid sequence corresponding to 
hVR-1, hVR-2, and rVR-2, whereas a "non-hVR-1, non-hVR-2, and non-rVR-2 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the hVR-1, hVR-2, and rVR-2 protein, 

15 e.g., a protein which is different from the hVR-1, hVR-2, and rVR-2 protein and which 
is derived from the same or a different organism. Within an hVR-1 , hVR-2, and rVR-2 
fusion protein the hVR-1, hVR-2, and rVR-2 polypeptide can correspond to all or a 
portion of an hVR-1, hVR-2, and rVR-2 protein. In a embodiment, an hVR-1, hVR-2, 
and rVR-2 fusion protein comprises at least one biologically active portion of an hVR-1, 
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In another embodiment, the fusion protein is an hVR-1, hVR-2, and rVR-2 
protein containing a heterologous signal sequence at its N-terminus. In certain host cells 
(e.g., mammalian host cells), expression and/or secretion of hVR-1, hVR-2, and rVR-2 
can be increased through use of a heterologous signal sequence. 
5 The hVR- 1 , hVR-2, and rVR-2 fusion proteins of the invention can be 

incorporated into pharmaceutical compositions and administered to a subject in vivo. 
The hVR-1, hVR-2, and rVR-2 fusion proteins can be used to affect the bioavailability 
of an hVR-1, hVR-2, and rVR-2 substrate. Use of hVR-1, hVR-2, and rVR-2 fusion 
proteins may be useful therapeutically for the treatment of disorders caused by, for 

10 example, (i) aberrant modification or mutation of a gene encoding an hVR-1 , hVR-2, 
and rVR-2 protein; (ii) mis-regulation of the hVR-K hVR-2, and rVR-2 gene; and (iii) 
aberrant post-translational modification of an hVR-1, hVR-2, and rVR-2 protein. 

Moreover, the hVR-1, hVR-2, and rVR-2-fusion proteins of the invention can be 
used as immunogens to produce anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies in a 

1 5 subject, to purify hVR- 1 , hVR-2, and rVR-2 ligands and in screening assays to identify 
molecules which inhibit the interaction of hVR-1, hVR-2, and rVR-2 with an hVR-1, 
hVR-2, and rVR-2 substrate. 

Preferably, an hVR-1, hVR-2, and rVR-2 chimeric or fusion protein of the 
invention is produced by standard recombinant DNA techniques. For example, DNA 

20 fragments coding for the different polypeptide sequences are ligated together in-frame in 
accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, fiUing-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 

25 the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 

30 Biology, eds. Ausubel et al John Wiley & Sons: 1992). Moreover, many expression 
vectors are conunercially available that already encode a fusion moiety {e,g., a GST 
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polypeptide). An hVR-1 , hVR-2, and rVR-2-encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the hVR-1, 
hVR-2, and rVR-2 protein. 

The present invention also pertains to variants of the hVR-1, hVR-2, and rVR-2 
5 proteins which function as either hVR-1, hVR-2, and rVR-2 agonists (mimetics) or as 
hVR-1, hVR-2, and rVR-2 antagonists. Variants of the hVR-1, hVR-2, and rVR-2 
proteins can be generated by mutagenesis, e.g., discrete point mutation or truncation of 
an hVR-I, hVR-2, and rVR-2 protein. An agonist of the hVR-1 , hVR-2, and rVR-2 
proteins can retain substantially the same, or a subset, of the biological activities of the 

10 naturally occurring form of an hVR-1 , hVR-2, and rVR-2 protein. An antagonist of an 
hVR-K hVR-2, and rVR-2 protein can inhibit one or more of the activities of the 
naturally occurring form of the hVR-l, hVR-2, and rVR-2 protein by, for example, 
competitively modulating an hVR-1, hVR-2, and rVR-2-mediated activity of an hVR-1, 
hVR-2, and rVR-2 protein. Thus, specific biological effects can be elicited by treatment 

1 5 with a variant of limited function. In one embodiment, treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of the 
protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the hVR-1, hVR-2, and rVR-2 protein. 

In one embodiment, variants of an hVR-1, hVR-2, and rVR-2 protein which 

20 function as either hVR- 1 , h VR-2, and rVR-2 agonists (mimetics) or as hVR- 1 , hVR-2, 
and rVR-2 antagonists can be identified by screening combinatorial libraries of mutants, 
e.g., truncation mutants, of an hVR-1, hVR-2, and rVR-2 protein for hVR-1, hVR-2, and 
rVR-2 protein agonist or antagonist activity. In one embodiment, a variegated library of 
hVR-1, hVR-2, and rVR-2 variants is generated by combinatorial mutagenesis at the 

25 nucleic acid level and is encoded by a variegated gene library. A variegated library of 
hVR-1, hVR-2, and rVR-2 variants can be produced by, for example, enzymatically 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential hVR-1, hVR-2, and rVR-2 sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins {e.g., for phage 

30 display) containing the set of hVR-1, hVR-2, and rVR-2 sequences therein. There are a 
variety of methods which can be used to produce libraries of potential hVR-1, hVR-2, 
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In another embodiment, the fusion protein is an hVR-1, hVR-2, and rVR-2 
protein containing a heterologous signal sequence at its N-terminus, In certain host cells 
(e.g., mammalian host cells), expression and/or secretion of hVR-1, hVR-2, and rVR-2 
can be increased through use of a heterologous signal sequence. 
5 The hVR-1, hVR-2, and rVR-2 fusion proteins of the invention can be 

incorporated into pharmaceutical compositions and administered to a subject in vivo. 
The hVR-1, hVR-2, and rVR-2 fusion proteins can be used to affect the bioavailability 
of an hVR-1, hVR-2, and rVR.2 substrate. Use of hVR-l, hVR.2, and rVR-2 fusion 
proteins may be useful therapeutically for the treatment of disorders caused by, for 

10 example, (i) aberrant modification or mutation of a gene encoding an hVR-1, hVR-2, 
and rVR-2 protein; (ii) mis-regulation of the hVR-K hVR-2, and rVR-2 gene; and (iii) 
aberrant post-translational modification of an hVR-1, hVR-2, and rVR-2 protein. 

Moreover, the hVR-1, hVR-2, and rVR-2-fusion proteins of the invention can be 
used as immunogens to produce anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies in a 

15 subject, to purify hVR-1, hVR-2, and rVR-2 ligands and in screening assays to identify 
molecules which inhibit the interaction of hVR- 1 , hVR-2, and rVR-2 with an hVR-1, 
hVR-2, and rVR-2 substrate. 

Preferably, an hVR-1 , hVR-2, and rVR-2 chimeric or fusion protein of the 
invention is produced by standard recombinant DNA techniques. For example, DNA 

20 fragments coding for the different polypeptide sequences are ligated together in-frame in 
accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid midesirable joining, and enzymatic ligation. In another embodiment, 

25 the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PGR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be armealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 

30 Biology, eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many expression 
vectors are commercially available that already encode a fiision moiety (e.g., a GST 
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polypeptide). An hVR-K hVR-2, and rVR-2-encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in- frame to the hVR-1, 
hVR-2, and rVR-2 protein. 

The present invention also pertains to variants of the hVR-1, hVR-2, and rVR-2 
5 proteins which function as either hVR- 1 , hVR-2, and rVR-2 agonists (mimetics) or as 
hVR-1, hVR-2, and rVR-2 antagonists. Variants of the hVR-1, hVR-2, and rVR-2 
proteins can be generated by mutagenesis, e.g. , discrete point mutation or truncation of 
an hVR-1, hVR-2, and rVR-2 protein. An agonist of the hVR-1, hVR-2, and rVR-2 
proteins can retain substantially the same, or a subset, of the biological activities of the 

10 naturally occurring form of an hVR-1, hVR-2, and rVR-2 protein. An antagonist of an 
hVR-1, hVR-2, and rVR-2 protein can inhibit one or more of the activities of the 
naturally occurring' form of the hVR-K hVR-2, and rVR-2 protein by, for example, 
competitively modulating an hVR-1, hVR-2, and rVR-2-mediated activity of an hVR-1, 
hVR-2, and rVR-2 protein. Thus, specific biological effects can be elicited by treatment 

1 5 with a variant of limited function. In one embodiment, treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of the 
protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the hVR-1, hVR-2, and rVR-2 protein. 

In one embodiment, variants of an hVR-1, hVR-2, and rVR-2 protein which 

20 function as either hVR- 1 , hVR-2, and rVR-2 agonists (mimetics) or as hVR- 1 , hVR-2, 
and rVR-2 antagonists can be identified by screening combinatorial libraries of mutants, 
e.g., truncation mutants, of an hVR-1, hVR-2, and rVR-2 protein for hVR-1, hVR-2, and 
rVR-2 protein agonist or antagonist activity. In one embodiment, a variegated library of 
hVR-1, hVR-2, and rVR-2 variants is generated by combinatorial mutagenesis at the 

25 nucleic acid level and is encoded by a variegated gene library. A variegated library of 
hVR-1, hVR-2, and rVR-2 variants can be produced by, for example, enzymatically 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential hVR-1, hVR-2, and rVR-2 sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins (e.^., for phage 

30 display) containing the set of hVR-1 , hVR-2, and rVR-2 sequences therein. There are a 
variety of methods which can be used to produce libraries of potential hVR-1, hVR-2, 
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and rVR-2 variants from a degenerate oligonucleotide sequence. Chemical synthesis of 
a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding 
5 the desired set of potential hVR- 1 , hVR-2, and rVR-2 sequences. Methods for 

synthesizing degenerate oligonucleotides are knov^n in the art (see, e.g., Narang, S.A, 
(1983) Tetrahedron 39:3; Itakura et al (1984) Annu, Rev. Biochem, 53:323; Itakura et 
ai {\9U)Science 198:1056; Ike im3) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of an hVR- 1 , hVR-2, and rVR-2 protein 

10 coding sequence can be used to generate a variegated population of hVR-1 , hVR-2, and 
rVR-2 fragments for screening and subsequent selection of variants of an hVR-1, hVR- 
2, and rVR-2 protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an hVR-1, hVR-2, and 
rVR-2 coding sequence with a nuclease under conditions wherein nicking occurs only 

15 about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/ant i sense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 

20 C-terminal and internal fragments of various sizes of the hVR-1, hVR-2, and rVR-2 
protein. 

Several techniques are knovm in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 

25 rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
hVR-1, hVR-2, and rVR-2 proteins. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 

30 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recrusive ensemble mutagenesis 
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(REM), a new technique which enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify hVR-1, hVR- 
2, and rVR-2 variants (Arkin and Yourvan (1992) Proc, Natl Acad, ScL USA S9:7811- 
78 1 5; Delgrave ei al ( 1 993) Protein Engineering 6(3):327-33 1 ). 
5 In one embodiment, cell based assays can be exploited to analyze a variegated 

hVR-1, hVR-2, and rVR-2 library. For example, a library of expression vectors can be 
transfected into a cell line, e,g,, a neuronal cell line, which ordinarily responds to a 
particular ligand in an hVR-1, hVR-2, and rVR-2-dependent manner. The transfected 
cells are then contacted with the ligand and the effect of expression of the mutant on 

10 signaling by the ligand can be detected, e.g., by measuring intracellular calcium 

concentration, neuronal membrane depolarization, or the activity of an hVR-1, hVR-2, 
and rVR-2-regulated transcription factor. Plasmid DNA can then be recovered from the 
cells which score for inhibition, or altematively, potentiation of signaling by the ligand, 
and the individual clones further characterized. 

15 An isolated hVR-1, hVR-2, and rVR-2 protein, or a portion or fragment thereof, 

can be used as an immunogen to generate antibodies that bind hVR-1, hVR-2, and rVR- 
2 using standard techniques for polyclonal and monoclonal antibody preparation. A 
full-length hVR-1, hVR-2, and rVR-2 protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of hVR-1, hVR-2, and rVR-2 for use as 

20 immunogens. The antigenic peptide of hVR-1, hVR-2, and rVR-2 comprises at least 8 
amino acid residues of the amino acid sequence shown in SEQ ID N0:2, 5, 8, or 1 1 and 
encompasses an epitope of hVR-1 , hVR-2, and rVR-2 such that an antibody raised 
against the peptide forms a specific immune complex with hVR-1, hVR-2, and rVR-2. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, more 

25 preferably at least 15 amino acid residues, even more preferably at least 20 amino acid 
residues, and most preferably at least 30 amino acid residues. 

Epitopes encompassed by the antigenic peptide are regions of hVR-1, hVR-2, 
and rVR-2 that are located on the surface of the protein, e.g., hydrophilic regions, as 
well as regions with high antigenicity (see, for example. Figures 12 and 14). 
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An hVR-1 , hVR-2, and rVR-2 immunogen typically is used to prepare antibodies 
by immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can contain, for example, 
recombinantly expressed hVR-1, hVR-2, and rVR-2 protein or a chemically synthesized 
5 hVR- 1 , hVR-2, and rVR-2 polypeptide. The preparation can further include an 
adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an immunogenic 
hVR-K hVR-2, and rVR-2 preparation induces a polyclonal anti-hVR-1, anti-hVR-2, 
and anti-rVR-2 antibody response. 

10 Accordingly, another aspect of the invention pertains to anti-hVR- 1 , anti-hVR-2, 

and anti-rVR-2 antibodies. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
(immunoreacts with) an antigen, such as hVR-1, hVR-2, and rVR-2. Examples of 

15 immunologically active portions of immunoglobulin molecules include F(ab) and 

F(ab')2 fragments which can be generated by treating the antibody with an enzyme such 
as pepsin. The invention provides polyclonal and monoclonal antibodies that bind hVR- 
1, hVR-2, and rVR-2. The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that contain 

20 only one species of an antigen binding site capable of immunoreacting with a particular 
epitope of hVR-1, hVR-2, and rVR-2. A monoclonal antibody composition thus 
typically displays a single binding affinity for a particular hVR-1, hVR-2, and rVR-2 
protein with which it inrmiunoreacts. 

Polyclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be prepared 

25 as described above by immunizing a suitable subject with an hVR- 1 , hVR-2, and rVR-2 
immunogen. The anti-hVR-l, anti-hVR-2, and anti-rVR-2 antibody titer in the 
immunized subject can be monitored over time by standard techniques, such as with an 
enzyme linked immunosorbent assay (ELISA) using immobilized hVR-1, hVR-2, and 
rVR-2. If desired, the antibody molecules directed against hVR-1, hVR-2, and rVR-2 

30 can be isolated from the mammal (e.g, , from the blood) and fiirther purified by well 
known techniques, such as protein A chromatography to obtain the IgG fraction. At an 
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appropriate time after immunization, e.g., when the anti-hVR-1, anti-hVR-2, and anti- 
rVR-2 antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein (1975) Nature 
5 256:495-497) (see also, Brown etal. (1981) J. Immunol. 127:539-46; Brown etal. 
(1980) y. Biol. Chem .255:4980-83; Yeh etol. (1976) Proc. Natl Acad, Set USA 
76:2927-3 1 ; and Yeh et al (1982) M J. Cancer 29:269-75), the more recent human B 
cell hybridoma technique (Kozbor et al (1983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al (1985), Monoclonal Antibodies and Cancer Therapy, 

10 Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses^ Plenum Publishing 
Corp., New York, New York (1980); E. A. Lemer (1981) Yale J. Biol Med., 54:387402; 
M. L. Gefter et al (1977) Somatic Cell Genet. 3:23136). Briefly, an immortal cell line 

1 5 (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with an hVR-1, hVR-2, and rVR-2 immunogen as described above, and the 
culture supematants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds hVR-1, hVR-2, and rVR-2. 
Any of the many well known protocols used for fusing lymphocytes and 

20 immortalized cell lines can be applied for the purpose of generating an anti-hVR-1, anti- 
hVR-2, and anti-rVR-2 monoclonal antibodies (see, e.g., G. Galfre et al (1977) Nature 
266:55052; Gefter et al Somatic Cell Genet., cited supra; Lemer, Yale J. Biol Med, 
cited supra; Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinarily 
skilled worker will appreciate that there are many variations of such methods which also 

25 would be usefiil. Typically, the immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fiising lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an immortalized mouse cell line, 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 

30 containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a fUsion partner according to standard 
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techniques, e,g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, HAT-sensitive mouse 
myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then selected using HAT medium, which 
5 kills unfused and unproductively fused myeloma cells (unfiised splenocytes die after 
several days because they are not transformed). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supematants for antibodies that bind hVR-1, hVR-2, and rVR-2, e,g., using a standard 
ELISA assay. 

10 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be identified and 
isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
antibody phage display library) with hVR-1, hVR-2, and rVR-2 to thereby isolate 
immunoglobulin library members that bind hVR-1, hVR-2, and rVR-2, Kits for 

15 generating and screening phage display libraries are commercially available (e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP^ Phage Display Kit, Catalog No. 240612). Additionally, examples 
of methods and reagents particularly amenable for use in generating and screening 
antibody display library can be found in, for example, Ladner et al U.S. Patent No. 

20 5,223,409; Kang et al. PCT International Publication No. WO 92/1 8619; Dower et al. 
PCT International Publication No. WO 91/17271; Winter et al PCT International 
Publication WO 92/20791; Markland et al, PCT International Publication No. WO 
92/15679; Breitling et al, PCT International Publication WO 93/01288; McCafferty et 
al, PCT International Publication No. WO 92/01047; Garrard et al. PCT International 

25 Publication No. WO 92/09690; Ladner et al. PCT International Publication No. WO 
90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et al (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al 
(1993) £M50 712:725-734; Hawkins e/ ai (1992)7. Mol Biol 226:889-896; Clarkson 
et al (1991) Nature 352:624-628; Gram et al (1992) Proc. Natl Acad ScL USA 

30 89:3576-3580; Garrad et al (1991) Bio/Technology 9:1373-1377; Hoogenboom et al 
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(1991)7\^wc. Acid Res, 19:4133-4137; Barbas ei ai (1991) Proc, Natl. Acad. Sci. USA 
88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies, 
such as chimeric and humanized monoclonal antibodies, comprising both human and 
5 non-human portions, which can be made using standard recombinant DNA techniques, 
are within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques known in the art, for 
example using methods described in Robinson et ai International Application No. 
PCT/US86/02269; Akira, et al. European Patent Application 184,187; Taniguchi, M., 

10 European Patent Application 1 71 ,496; Morrison et ai European Patent Application 
173,494; Neuberger al. PCT International Publication No. WO 86/01533; Cabilly et 
al. U.S. Patent No. 4,816,567; Cabilly et ai European Patent Application 125,023; 
Better a/, (1988) 5c/e«ce 240:1041-1043; Unetal. {\9il)Proc, Natl Acad Sci. USA 
84:3439-3443; Line/ a/. (1987)/ Immunol 139:3521-3526; Sun a/. (1987) Proc. 

15 Natl Acad Set USA 84:214-218; Nishimura et al (1987) Cane. Res. 47:999-1005; 
Wood etal (1985) A^a/wre 314:446-449; and Shaw era/. (1988)7. Natl Cancer Inst. 
80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi etal (1986) 
BloTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al (1986) Nature 
321:552-525; Verhoeyan et al (1988) Science 239:1534; and Beidler etal (1988)7. 

20 Immunol 141:4053-4060. 

An anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody {e.g., monoclonal 
antibody) can be used to isolate hVR-1, hVR-2, and rVR-2 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An anti-hVR-1, anti-hVR-2, and 
anti-rVR-2 antibody can facilitate the purification of natural hVR-1, hVR-2, and rVR-2 

25 from cells and of recombinantly produced hVR-1, hVR-2, and rVR-2 expressed in host 
cells. Moreover, an anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be used to 
detect hVR-1, hVR-2, and rVR-2 protein {e.g.^ in a cellular lysate or cell supernatant) in 
order to evaluate the abundance and pattern of expression of the hVR-1, hVR-2, and 
rVR-2 protein. Anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be used 

30 diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
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be facilitated by coupling (/'.<?., physically linking) the antibody to a detectable 
substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and 
radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
5 alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin^iotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
10 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include ^^^I, 1^*1, or ^H. 



III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 

15 vectors, containing a nucleic acid encoding an hVR-1, hVR-2, and rVR-2 protein (or a 
portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 

20 wherein additional DNA segments can be ligated into the viral genome. Certain vectors 
are capable of autonomous replication in a host cell into which they are introduced {e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated 

25 along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most 

30 commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

10 of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 

15 Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cells and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g. , 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., hVR-1, hVR-2, and rVR-2 proteins, mutant forms of hVR-1, 
hVR-2, and rVR-2 proteins, fusion proteins, and the like). 

25 The recombinant expression vectors of the invention can be designed for 

expression of hVR-1, hVR-2, and rVR-2 proteins in prokaryotic or eukaryotic cells. For 
example, hVR-1, hVR-2, and rVR-2 proteins can be expressed in bacterial cells such as 
E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 

30 Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vUro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
5 fusion or non-fiision proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

10 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 

15 and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in hVR-1, hVR-2, and rVR-2 activity 
assays, {e.g., direct assays or competitive assays described in detail below), or to, for 

20 example, generate antibodies specific for hVR-1, hVR-2, and rVR-2 proteins. In a 
embodiment, an hVR-1, hVR-2, and rVR-2 fusion protein expressed in a retroviral 
expression vector of the present invention can be utilized to infect bone marrow cells 
which are subsequently transplanted into irradiated recipients. The pathology of the 
subject recipient is then examined after sufficient time has passed {e.g.^ six (6) weeks). 

25 Examples of suitable inducible non-fusion E. coli expression vectors include 

pTrc (Amann et al, (1988) Gene 69:301-3 15) and pET 1 Id (Studier et al. Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). Target gene expression firom the pTrc vector relies on host 
RNA polymerase transcription fi-om a hybrid trp-lac fusion promoter. Target gene 

30 expression from the pET 1 Id vector relies on transcription fi*om a T7 gnlO-Iac fiision 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
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polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in £*. coli is to express 
5 the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymalogy 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
10 preferentially utilized in £. coli (Wada et ai, (1992) Nucleic Acids Res. 20:21 1 1-2118). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the hVR-1, hVR-2, and rVR-2 expression vector is a 
yeast expression vector. Examples of vectors for expression in yeast S. cerivisae include 
15 pYepSecl (Baldari, et ai, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 

(1982) Ce// 30:933-943), pJRYSS (Schultz etaL, (1987) Gene 54:113-123), pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, hVR-1, hVR-2, and rVR-2 proteins can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus vectors availiable for expression 
20 of proteins in cultured insect cells (e,g. , Sf 9 cells) include the pAc series (Smith et al. 

(1983) MoL Cell Biol, 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
F/ro/ogy 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

25 expression vectors include pCDM8 (Seed, B. (1 987) Nature 329:840) and pMT2PC 
(Kaufman et al (1987) EMBO J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma. Adenovirus 2, 
cytomegalovirus and Simian Virus 40, For other suitable expression systems for both 

30 prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring 
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Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 {e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 

10 Baltimore (1989) EMBO 1 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
{e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl Acad. ScL USA 
86:5473-5477), pancreas-specific promoters (Edlund et al (1985) Science 230:912-916), 
and mammary gland-specific promoters {e.g,, milk whey promoter; U.S. Patent No. 

1 5 4,873,3 1 6 and European Application Publication No. 264,1 66). Developmental ly- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev, 3:537-546). 

The expression characteristics of an endogenous hVR-1, hVR-2, and rVR-2 gene 

20 within a cell line or microorganism may be modified by inserting a heterologous DNA 
regulatory element into the genome of a stable cell line or cloned microorganism such 
that the inserted regulatory element is operatively linked with the endogenous hVR-1, 
hVR-2, and rVR-2 gene. For example, an endogenous hVR-1 , hVR-2, and rVR-2 gene 
which is normally "trancriptionally silent", Le., a hVR-1, hVR-2, and rVR-2 gene which 

25 is normally not expressed, or is expressed only at very low levels in a cell line or 

microorganism, may be activated by inserting a regulatory element which is capable of 
promoting the expression of a normally expressed gene product in that cell line or 
microorganism. Alternatively, a transcriptionally silent, endogenous hVR-1, hVR-2, 
and rVR-2 gene, may be activated by insertion of a promiscuous regulatory element that 

30 works across cell types. 
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A heterologous regulatory element may be inserted into a stable cell line or 
cloned microorganism, such that it is operatively linked with an endogenous hVR-1, 
hVR-2, and rVR-2 gene, using techniques, such as targeted homologous recombination, 
which are well known to those of skill in the art, and described e,g., in Chappel, U.S. 
5 Patent No.: 5,272,071 ; PCT publication No. WO 91/06667, published May 16, 1991. 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 

1 0 RN A molecule which is antisense to hVR- 1 , hVR-2, and rVR-2 mRN A. Regulatory 
sequences operatively linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a 
variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue specific or cell type specific 

1 5 expression of antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are 
produced under the control of a high efficiency regulatory region, the activity of which 
can be determined by the cell type into which the vector is introduced. For a discussion 
of the regulation of gene expression using antisense genes see Weintraub, H. et ai, 

20 Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which an hVR- 1 , hVR- 
2, and rVR-2 nucleic acid molecule of the invention is introduced, e.g,^ an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule within a recombinant expression vector or an hVR- 

25 1, hVR-2, and rVR-2 nucleic acid molecule containing sequences which allow it to 
homologously recombine into a specific site of the host cell's genome. The terms "host 
cell" and "recombinant host cell" are used interchangeably herein. It is understood that 
such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 

30 generations due to either mutation or environmental influences, such progeny may not, 
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in fact, be identical to the parent cell, but are still included within the scope of the term 
as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an hVR-1, 
hVR-2, and rVR-2 protein can be expressed in bacterial cells such as £. coli, insect cells, 
5 yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). 
Other suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 

10 techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al (Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 

15 Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is knovm that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 

20 generally introduced into the host cells along with the gene of interest, selectable 

markers include those which confer resistance to drugs, such as G41 8, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host 
cell on the same vector as that encoding an hVR-1, hVR-2, and rVR-2 protein or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 

25 acid can be identified by drug selection (e.g, , cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) an hVR-1, hVR-2, and rVR-2 protein. 
Accordingly, the invention further provides methods for producing an hVR-1, hVR-2, 

30 and rVR-2 protein using the host cells of the invention. In one embodiment, the method 
comprises culturing the host cell of invention (into which a recombinant expression 
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vector encoding an hVR-1, hVR-2, and rVR-2 protein has been introduced) in a suitable 
medium such that an hVR-1, hVR-2, and rVR-2 protein is produced. In another 
embodiment, the method further comprises isolating an hVR-1, hVR-2, and rVR-2 
protein from the medium or the host cell. 
5 The host cells of the invention can also be used to produce non-human transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which hVR-1, hVR-2, and rVR-2-coding 
sequences have been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous hVR-1, hVR-2, and rVR-2 sequences have been 

1 0 introduced into their genome or homologous recombinant animals in which endogenous 
hVR-1, hVR-2, and rVR-2 sequences have been altered. Such animals are useful for 
studying the function and/or activity of an hVR-1, hVR-2, and rVR-2 and for identifying 
and/or evaluating modulators of hVR-1, hVR-2, and rVR-2 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 

1 5 rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, and the like, A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, thereby directing the expression 

20 of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous hVR-1, hVR-2, and rVR- 
2 gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, eg,, an 

25 embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing an hVR-1, 
hVR-2, and rVR-2-encoding nucleic acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The hVR-1, hVR-2. and rVR-2 cDNA sequence 

30 of SEQ ID NO: 1 , 3, 5, 7 or 9 can be introduced as a transgene into the genome of a non- 
human animal. Alternatively, a nonhuman homologue of a hVR-2 gene, such as a 
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mouse or rat hVR-2, e.g., the rVR-2 gene, can be used as a transgene. Alternatively, an 
hVR-1, hVR-2, and rVR-2 gene homologue, such as another member of the 
CapsaicinA^anilloid family, can be isolated based on hybridization to the hVR-1, hVR-2, 
and rVR-2 cDNA sequences of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12, (described further 
5 in subsection I above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression of 
the transgene, A tissue-specific regulatory sequence(s) can be operably linked to an 
hVR-1, hVR-2, and rVR-2 transgene to direct expression of an hVR-1, hVR-2, and rVR- 
2 protein to particular cells. Methods for generating transgenic animals via embryo 

10 manipulation and microinjection, particularly animals such as mice, have become 

conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4,870,009, both by Leder et aL, U.S. Patent No. 4,873,191 by Wagner el al and in 
Hogan, B., Manipulating the Mouse Embryo., (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1986). Similar methods are used for production of other 

15 transgenic animals. A transgenic founder animal can be identified based upon the 

presence of an hVR-1, hVR-2, and rVR-2 transgene in its genome and/or expression of 
hVR-I, hVR-2, and rVR-2 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an hVR-1, hVR-2, and 

20 rVR-2 protein can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of an hVR-1, hVR-2, and rVR-2 gene into w^hich a deletion, addition or 
substitution has been introduced to thereby alter, e,g., functionally disrupt, the hVR-1, 
hVR-2, and rVR-2 gene. The VR-1 or VR-2 gene can be a human gene (e.g., the cDNA 

25 of SEQ ID NO: 1, 3, 5, 4, 6, 7, or 9), but more preferably, is a non-human homologue of 
a hVR-1 and hVR-2 gene {e.g., the cDNA of SEQ ID NO: 10 or 12, or a cDNA isolated 
by stringent hybridization with the nucleotide sequence of SEQ ID NO: 1, 3, 5, 4, 6, 7, 
or 9). For example, a mouse VR-2 gene can be used to construct a homologous 
recombination nucleic acid molecule, e.g., a vector, suitable for altering an endogenous 

30 VR-2 gene in the mouse genome. In a embodiment, the homologous recombination 
nucleic acid molecule is designed such that, upon homologous recombination, the 
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endogenous hVR-1, hVR-2, and rVR-2 gene is functionally disrupted (Le,y no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
homologous recombination nucleic acid molecule can be designed such that, upon 
homologous recombination, the endogenous hVR-1, hVR-2, and rVR-2 gene is mutated 
5 or otherwise altered but still encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous hVR-1, hVR-2, 
and rVR-2 protein). In the homologous recombination nucleic acid molecule, the altered 
portion of the hVR-1 , hVR-2, and rVR-2 gene is flanked at its 5' and 3' ends by 
additional nucleic acid sequence of the hVR-1, hVR-2, and rVR-2 gene to allow for 

10 homologous recombination to occur between the exogenous hVR-1, hVR-2, and rVR-2 
gene carried by the homologous recombination nucleic acid molecule and an 
endogenous hVR-1, hVR-2, and rVR-2 gene in a cell, e.g., an embryonic stem cell. The 
additional flanking hVR-1, hVR-2, and rVR-2 nucleic acid sequence is of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 

15 several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the 
homologous recombination nucleic acid molecule (see, e.g., Thomas, K.R. and 
Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination 
vectors). The homologous recombination nucleic acid molecule is introduced into a cell, 
e.g., an embryonic stem cell line (e.g., by electroporation) and cells in which the 

20 introduced hVR-1, hVR-2, and rVR-2 gene has homologously recombined with the 
endogenous hVR-1, hVR-2, and rVR-2 gene are selected (see e.g., Li, E. et al (1992) 
Cell 69:915). The selected cells can then injected into a blastocyst of an animal {e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A, in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) 

25 pp. 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 

30 nucleic acid molecules, e.g., vectors, or homologous recombinant animals are described 
further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
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Intemational Publication Nos.: WO 90/1 1354 by Le Mouellec et al,\ WO 91/01 140 by 
Smithies et al.; WO 92/0968 by Zijlstra et al.\ and WO 93/04169 by Bems et al 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
5 example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad, Set USA 89:6232-6236, Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
10 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, eg., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

15 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut, I. et al (1997) Nature 385:810- 
81 3 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. In brief, 
a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter Gq phase. The quiescent cell can then be fiised, e.g., through 

20 the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The recontructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is isolated. 

25 

IV. Pharmaceutical Compositions 

The hVR-1, hVR-2, and rVR-2 nucleic acid molecules, fragments of hVR-1, 
hVR-2, and rVR-2 proteins, and anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
30 phannaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
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acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 
5 active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
1 0 with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
1 5 polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
20 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 

25 extemporsmeous preparation of sterile injectable solutions or dispersion. For 

intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 

30 and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
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example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
5 surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 

10 brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a fragment of an hVR-1, hVR-2, and rVR-2 protein or an anti-hVR-1, 
anti-hVR-2, and anti-rVR-2 antibody) in the required amount in an appropriate solvent 

15 with one or a combination of ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile powders 
for the preparation of sterile injectable solutions, the methods of preparation are vacuum 

20 drying and freeze-drying which yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 

25 used in the form of tablets, troches, or capsules, oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 

30 following ingredients, or compounds of a similar nature: a binder such as 

microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
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lactose, a disintegrating agent such as alginic acid, Primogei, or com starch; a lubricant 
such as magnesium stearate or Sterotes; a giidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
5 For administration by inhalation, the compounds are delivered in the form of an 

aerosol spray from pressured container or dispenser which contains a suitable propellant, 
^'g-y a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

10 permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 

15 the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 

20 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

25 The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1 . 
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It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

1 0 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 

15 LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to iminfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 

20 formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 

25 from cell culture assays. A dose may be formulated in animal models to achieve a 

circulating plasma concentration range that includes the IC50 (/.e., the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
usefiil doses in humans. Levels in plasma may be measured, for example, by high 

30 performance liquid chromatography. 
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As defined herein, a therapeutically effective amount of protein or polypeptide 
(/.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body 
weight, and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 

5 7 mg/kg, or 5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain 
factors may influence the dosage required to effectively treat a subject, including but not 
limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject 
with a therapeutically effective amount of a protein, polypeptide, or antibody can 

1 0 include a single treatment or, preferably, can include a series of treatments. In a 

preferred example, a subject is treated with antibody, protein, or polypeptide in the range 
of between about 0.1 to 20 mg/kg body weight, one time per week for between about 1 
to 10 weeks, preferably between 2 to 8 weeks, more preferably between about 3 to 7 
weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be appreciated 

1 5 that the effective dosage of antibody, protein, or polypeptide used for treatment may 
increase or decrease over the course of a particular treatment. Changes in dosage may 
result and become apparent from the results of diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or 
activity. An agent may, for example, be a small molecule. For example, such small 

20 molecules include, but are not limited to, peptides, peptidomimetics, amino acids, amino 
acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, 
organic or inorganic compounds (i.e,. including heteroorganic and organometallic 
compounds) having a molecular weight less than about 10,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 5,000 grams per 

25 mole, organic or inorganic compounds having a molecular weight less than about 1,000 
grams per mole, organic or inorganic compounds having a molecular weight less than 
about 500 grams per mole, and salts, esters, and other pharmaceutically acceptable forms 
of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
30 number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon 
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the identity, size, and condition of the subject or sample being treated, further depending 
upon the route by which the composition is to be administered, if applicable, and the 
effect which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. 
5 Exemplary doses include milligram or microgram amounts of the small molecule 

per kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. 
It is furthermore understood that appropriate doses of a small molecule depend upon the 

10 potency of the small molecule with respect to the expression or activity to be modulated. 
Such appropriate doses may be determined using the assays described herein. 

When one or more of these small molecules is to be administered to an animal 
(e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic 
acid of the invention, a physician, veterinarian, or researcher may, for example, 

15 prescribe a relatively low dose at first, subsequently increasing the dose until an 

appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, 

20 the rate of excretion, any drug combination, and the degree of expression or activity to 
be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene ther^y vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 

25 stereotactic injection (see e.g., Chen et al (1994) Proc. Nail Acad. Sci. USA 91 :3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the gene 
therapy vector in an acceptable diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery 
vector can be produced intact firom recombinant cells, e.g., retroviral vectors, the 

30 pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 
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The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention . 
5 The nucleic acid molecules, proteins, protein homologues, and antibodies 

described herein can be used in one or more of the following methods: a) screening 
assays; b) predictive medicine {e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., therapeutic and 
prophylactic). As described herein, an hVR-1, hVR-2, and rVR-2 protein of the 

1 0 invention has one or more of the following activities: ( 1 ) it interacts with a non-hVR- 1 , 
non-hVR-2, and non-rVR-2 protein molecule, e.g, a vanilloid compound such as 
capsaicin; (2) it modulates intracellular calcium concentration; (3) it activates an hVR- 
1, hVR-2, and rVR-2-dependent signal transduction pathway; and (4) it modulates a pain 
signaling mechanism, and, thus, can be used to, for example, (1) modulate the 

1 5 interaction with a non-hVR-1 , non-hVR-2, and non-rVR-2 protein molecule; (2) 

modulate intracellular calcium concentration; (3) activate an hVR-1, hVR-2, and rVR- 
2-dependent signal transduction pathway; and (4) modulate a pain signaling mechanism. 

The isolated nucleic acid molecules of the invention can be used, for example, to 
express hVR-1, hVR-2, and rVR-2 protein (e.g., via a recombinant expression vector in 

20 a host cell in gene therapy applications), to detect hVR-1 , hVR-2, and rVR-2 mRNA 
{e.g., in a biological sample) or a genetic alteration in an hVR-1 , hVR-2, and rVR-2 
gene, and to modulate hVR-I , hVR-2, and rVR-2 activity, as described further below. 
The hVR-1, hVR-2, and rVR-2 proteins can be used to screen for naturally occurring 
hVR-1, hVR-2, and rVR-2 substrates, to screen for drugs or compounds which modulate 

25 hVR-1 , hVR-2, and rVR-2 activity, as well as to treat disorders characterized by 

insufficient or excessive production of hVR-1, hVR-2, and rVR-2 protein or production 
of hVR-1, hVR-2, and rVR-2 protein forms which have decreased or aberrant activity 
compared to hVR-1, hVR-2, and rVR-2 wild type protein {e.g., pain disorders). 
Moreover, the anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies of the mvention can 

30 be used to detect and isolate hVR- 1 , hVR-2, and rVR-2 proteins, regulate the 



wo 00/29577 PCT/US99/26701 

-60- 

bioavailability of hVR-1, hVR-2, and rVR-2 proteins, and modulate hVR-1, hVR-2, and 
rVR-2 activity, 

A. Screening Assays : 
5 The invention provides a method (also referred to herein as a "screening assay") 

for identifying modulators, i.e., candidate or test compounds or agents ie.g,^ peptides, 
peptidomimetics, small molecules or other drugs) which bind to hVR-1, hVR-2, and 
rVR-2 proteins, have a stimulatory or inhibitory effect on, for example, hVR-1, hVR-2, 
and rVR-2 expression or hVR-1, hVR-2, and rVR-2 activity, or have a stimulatory or 
10 inhibitory effect on, for example, the expression or activity of hVR-1, hVR-2, and rVR- 
2 substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of an hVR-1, hVR-2, and rVR-2 protein or polypeptide 
or biologically active portion thereof. In another embodiment, the invention provides 

15 assays for screening candidate or test compounds which bind to or modulate the activity 
of an hVR-1, hVR-2, and rVR-2 protein or polypeptide or biologically active portion 
thereof. The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 

20 synthetic library methods requiring deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des, 12:145). 

25 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt etal (1993) Proc, Natl Acad. Sci. U.S,A, 90:6909; Erb et 
al (1994) Proc. Natl Acad. Set USA 91 :1 1422; Zuckermann et al (1994). J. Med 
Chem. 37:2678; CYioetal {1993) Science 261:1303; Carrell etal {\994)Angew. Chem. 
Int. Ed Engl 33:2059; Carell et al {\99At) Angew, Chem. Int. Ed. Engl 33:2061; and in 

30 Gallop et al (1994) J, Med Chem. 37:1233. 



wo 00/29577 PCT/US99/26701 

-61 . 

Libraries of compounds may be presented in solution {e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1 993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
•409), plasmids (Cull et ai (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
5 (Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
(Cwiria et al (1990) Proc, Natl Acad Sci, 87:6378-6382); (Felici (1991) I Mol Biol 
222:301-310); (Ladner ^-wpra). 

In one embodiment, an assay is a cell-based assay in which a cell, e.g., a 
neuronal cell, which expresses an hVR-1, hVR-2, and rVR-2 protein or biologically 

1 0 active portion thereof is contacted with a test compound and the ability of the test 

compound to modulate hVR-1, hVR-2, and rVR-2 activity is determined. Determining 
the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 activity can be 
accomplished by monitoring, for example, intracellular calcium concentration or 
membrane depolarization by, e.g., patch-clamp recordings in whole-cell, inside-out, and 

15 outside-out configurations (as described in, for example, Tominaga M. et al (1998) 
Neuron 21 :53 1-543). Determining the ability of the test compound to modulate hVR-1, 
hVR-2, and rVR-2 activity can further be accomplished by monitoring the activity of an 
hVR-1 , hVR-2, and rVR-2-regulated transcription factor. The cell, for example, can be 
of mammalian origin, e.g., a neuronal cell. 

20 The ability of the test compound to modulate hVR-1 , hVR-2, and rVR-2 binding 

to a substrate or to bind to hVR-1, hVR-2, and rVR-2 can also be determined. 
Determining the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 
binding to a substrate can be accomplished, for example, by coupling the hVR-l, hVR- 
2, and rVR-2 substrate with a radioisotope or enzymatic label such that binding of the 

25 hVR- 1 , hVR-2, and rVR-2 substrate to hVR- 1 , hVR-2, and rVR-2 can be determined by 
detecting the labeled hVR-1, hVR-2, and rVR-2 substrate in a complex. Determining 
the ability of the test compound to bind hVR-1, hVR-2, and rVR-2 can be accomplished, 
for example, by coupling the compound vdth a radioisotope or enzymatic label such that 
binding of the compoimd to hVR-1, hVR-2, and rVR-2 can be determined by detecting 

30 the labeled hVR-1, hVR-2, and rVR-2 compound in a complex. For example, 

compounds (e.g., hVR-1, hVR-2, and rVR-2 substrates) can be labeled with ^'^h, ^^S, 



PCT/US99/26701 

WO 00/29577 

-62- 

14c, or 3h, either directly or indirectly, and the radioisotope detected by direct counting 
of radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., an hVR-1, hVR-2, and rVR-2 substrate) to interact with hVR-1, hVR-2, 
and rVR-2 without the labeling of any of the interactants. For example, a 
microphysiometer can be used to detect the interaction of a compound with hVR-1, 
hVR-2, and rVR-2 without the labeling of either the compound or the hVR-1 , hVR-2, 
and rVR-2. McConnell, H. M. el al (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor) is an analytical instrument that measures the rate 
at which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 
between a compound and hVR-1, hVR-2, and rVR-2. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which an hVR-1 , hVR-2, and rVR-2 protein or biologically active portion thereof is 
contacted with a test compound and the ability of the test compound to bind to the hVR- 
1, hVR-2, and rVR-2 protein or biologically active portion thereof is determined, 
biologically active portions of the hVR-1, hVR-2, and rVR-2 proteins to be used in 
assays of the present invention include fragments which participate in interactions with 
non-hVR-1 , non-hVR-2, and non-rVR-2 molecules, e.g., fragments with high surface 
probability scores. Binding of the test compound to the hVR-1, hVR-2, and rVR-2 
protein can be determined either directly or indirecUy as described above. In a 
embodiment, the assay includes contactmg the hVR-1, hVR-2, and rVR-2 protein or 
biologically active portion thereof with a known compound which binds hVR-1, hVR-2, 
and rVR-2 to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an hVR-1, hVR-2, and 
rVR-2 protein, wherein determining the ability of the test compound to interact with an 
hVR-1, hVR-2, and rVR-2 protein comprises determining the ability of the test 



PCT/US99/2670I 

WO 00/29577 

-63- 

compound to preferentially bind to hVR-1, hVR-2. and rVR-2 or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which an hVR-1 , hVR-2, 
and rVR-2 protein or biologically active portion thereof is contacted with a test 
5 compound and the ability of the test compound to modulate (..g., stimulate or inhibit) 
the activity of the hVR-1. hVR-2. and rVR-2 protein or biologically active portion 
thereof is determined. Determining the ability of the test compound to modulate the 
activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished, for example, by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, 
10 hVR-2, and rVR-2 target molecule, e.g., a vaniUoid compound such as capsaicin, by one 
of the methods described above for determining direct binding. Determining the abiUty 
of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, hVR-2, and rVR-2 target 
molecule can also be accomplished using a technology such as real-time Biomolecular 
Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. iXm)Anal Chem. 
15 63:2338-2345 and Szabo ei al. (1995) Curr. Opin. Struct. Biol. 5:699-705. As used 
herein "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BlAcore). Changes in the optical phenomenon of 
surface plasmon resonance (SPR) can be used as an indication of real-time reactions 

between biological molecules. 

20 m an alternative embodiment, determining the ability of the test compound to 

modulate the activity of an hVR-1. hVR-2. and rVR-2 protein can be accomplished by 
determining the ability of the hVR-1 , hVR-2, and rVR-2 protein to further modulate the 
activity of a downstream effector of an hVR-1, hVR-2, and rVR-2 target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined 

25 or the binding of the effector to an appropriate target can be determined as previously 
described. 

m yet another embodiment, the cell-free assay involves contacting an hVR-1 , 
hVR-2, and rVR-2 protein or biologically active portion thereof with a known 
compound which binds the hVR-1, hVR-2, and rVR-2 protein to form an assay mixture, 
30 contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with the hVR-1, hVR-2, and rVR-2 protein, wherein 
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determining the ability of the test compound to interact with the hVR-1, hVR-2, and 
rVR-2 protein comprises determining the ability of the hVR-1, hVR-2, and rVR-2 
protein to preferentially bind to or modulate the activity of an hVR-1 , hVR-2, and rVR-2 
target molecule. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of isolated proteins {e.g., hVR-1, hVR-2, and rVR-2 
proteins or biologically active portions thereof). In the case of cell-firee assays in which 
a membrane-bound form of an isolated protein is used it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of the isolated protein is 
maintained in solution. Examples of such solubilizing agents include non-ionic 
detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl- 
N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-1 14, 
Thesit®, Isotridecypoly(ethylene glycol ether)n, 3-[(3- 
cholamidopropyl)dimethylamminio]-l-propane sulfonate (CHAPS), 3-[(3- 
cholamidopropyl)dimethylaniminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N- 
dodecyl=N,N-dimethyl-3-animonio-l-propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either hVR-1, hVR-2, and rVR-2 or its 
target molecule to facilitate separation of complexed from uncomplexed forms of one or 
both of the proteins, as well as to accommodate automation of the assay. Binding of a 
test compound to an hVR-1, hVR-2, and rVR-2 protein, or interaction of an hVR-1, 
hVR-2, and rVR-2 protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example. 
glutathione-S-transferase/ hVR-1, hVR-2, and rVR-2 fusion proteins or glutathione-S- 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are 
then combined with the test compound or the test compound and either the non-adsorbed 
target protein or hVR-1, hVR-2, and rVR-2 protein, and the mixture incubated under 
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conditions conducive to complex formation {e.g., at physiological conditions for salt and 
pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described above. Alternatively, 
the complexes can be dissociated from the matrix, and the level of hVR-l, hVR-2, and 
rVR-2 binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either an hVR-1, hVR-2, and rVR-2 
protein or an hVR-1, hVR-2, and rVR-2 target molecule can be immobilized utilizing 
conjugation of biotin and streptavidin. Biotinylated hVR-1, hVR-2, and rVR-2 protein 
or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford. IL), and 
inunobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with hVR-l, hVR-2, and rVR-2 protein or target 
molecules but which do not interfere with binding of the hVR-1, hVR-2, and rVR-2 
protein to its target molecule can be derivatized to the wells of the plate, and unbound 
target or hVR- 1 , hVR-2, and rVR-2 protein trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 

immunodetection of complexes 
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of the candidate compound than in its absence, the candidate compound is identified as a 
stimulator of hVR-1, hVR-2, and rVR-2 mRNA or protein expression. Alternatively, 
when expression of hVR-l, hVR-2, and rVR-2 mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
5 candidate compound is identified as an inhibitor of hVR-1 , hVR-2. and rVR-2 mRNA or 
protein expression. The level of hVR-1. hVR-2, and rVR-2 mRNA or protem 
expression in the cells can be determined by methods described herein for detecUng 
hVR-1 hVR-2, and rVR-2 mRNA or protein. 

'in yet another aspect of the invention, the hVR-1, hVR-2, and rVR-2 proteins 
10 can be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see. e.g. , U.S. 
Patent No. 5,283,317; Zervos et al. (1993) Cell liai^-in; Madura et al. (1993) J. 
Biol. Chem. 268:12046-12054; Bartel ei al (1993) Biotechmues 14:920-924; Iwabuchi 
et al. (1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other 
proteins, which bind to or interact with hVR-1 , hVR-2. and rVR-2 ("hVR- 1 -binding 
15 proteins", "hVR.2-binding proteins", and "rVR-2-binding proteins" or "hVR-l-bp", 
"hVR-2-bp", and "rVR-2-bp") and are involved in hVR-1, hVR-2, and rVR-2 actw>ty. 
Such hVR-1, hVR-2, and rVR-2-binding proteins are also likely to be involved in the 
propagation of signals by the hVR-1, hVR.2. and rVR-2 proteins or hVR-1, hVR-2, and 
rVR-2 targets as, for example, downstream elements of an hVR-1, hVR-2, and rVR-2- 
20 mediated signaling pathway, e.g , a pain signaling pathway. Alternatively, such hVR-1 . 
hVR-2, and rVR-2-binding proteins are likely to be hVR-1, hVR-2. and rVR-2 
inhibitors. 

■nie two-hybrid system is based on the modular nature of most transcription 
factors which consist of separable DNA-binding and activation domains. Briefly, the 
25 assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
hVR-l hVR-2, and rVR-2 protein is fiised to a gene encoding the DNA binding domam 
of a known transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, 
ftom a library of DNA sequences, that encodes an unidentified protein ("prey" or 
"sample") is fused to a gene that codes for the activation domain of the known 
30 transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
fomiing an hVR-1, hVR-2, and rVR-2-dependent complex, the DNA-binding and 
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activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene {e.g., LacZ) which is operably linked to 
a transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
5 factor can be isolated and used to obtain the cloned gene which encodes the protein 
which interacts with the hVR-1, hVR-2, and rVR-2 protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
10 agent identified as described herein (..g., an hVR-1, hVR-2, and rVR-2 modulating 

agent, an antisense hVR-1 , hVR-2, and rVR-2 nucleic acid molecule, an hVR-1, hVR-2, 
and rVR-2-specific antibody, or an hVR-1, hVR-2, and rVR-2-binding partner) can be 
used in an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent. Alternatively, an agent identified as described herein can be used in 
15 an animal model to determine the mechanism of action of such an agent. Furthermore, 
this invention pertains to uses of novel agents identified by the above-described 
screening assays for treatments as described herein. 



B. Detection Assays 

20 Portions or fragments of the cDNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual fi:om a minute biological sample (tissue 

25 typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

1 rhromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the hVR-1 , hVR-2, 
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and rVR-2 nucleotide sequences, described herein, can be used to map the location of 
the hVR-1 , hVR-2, and rVR-2 genes on a chromosome. The mapping of the hVR-1 , 
hVR-2, and rVR-2 sequences to chromosomes is an important first step in correlating 
these sequences with genes associated witii disease. 

5 Briefly, hVR- 1 , hVR-2, and rVR-2 genes can be mapped to chromosomes by 

preparing PCR primers (preferably 15-25 bp in length) fi-om the hVR-1. hVR-2, and 
rVR-2 nucleotide sequences. Computer analysis of the hVR-1 , hVR-2, and rVR-2 
sequences can be used to predict primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be 

1 0 used for PCR screening of somatic cell hybrids containing individual human 

chromosomes. Only those hybrids containing the human gene corresponding to the 
hVR-1. hVR-2, and rVR-2 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells fi-om different 
mammals (e.g., human and mouse cells). As hybrids of human and mouse cells grow 

1 5 and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
nf hvhrid cell Unes can be established. Each cell line in a panel contains either a single 
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USA, 87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence m situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 
5 one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical such as colcemid that dismpts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
10 as short as 500 or 600 bases. However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1 ,000 bases, and more preferably 2,000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al. Human Chromosomes: A Manual of Basic Techniques 
15 (PergamonPress,New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are for mapping purposes. Coding sequences 
20 are more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
25 Man, available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example. Egeland, J. et al. (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
30 unaffected with a disease associated with the hVR-1, hVR-2, and rVR-2 gene, can be 
determined. If a mutaUon is observed in some or all of the affected individuals but not 
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and rVR-2 nucleotide sequences, described herein, can be used to map the location of 
the hVR-1, hVR-2, and rVR-2 genes on a chromosome. The mapping of the hVR-1, 
hVR-2, and rVR-2 sequences to chromosomes is an important first step in correlating 
these sequences with genes associated with disease. 

5 Briefly, hVR-1, hVR-2, and rVR-2 genes can be mapped to chromosomes by 

preparing PGR primers (preferably 1 5-25 bp in length) from the h VR- 1 , h VR-2, and 
rVR-2 nucleotide sequences. Computer analysis of the hVR-1, hVR-2, and rVR-2 
sequences can be used to predict primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be 

1 0 used for PGR screening of somatic cell hybrids containing individual human 

chromosomes. Only those hybrids containing the human gene corresponding to the 
hVR-K hVR-2, and rVR-2 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
mammals {e.g., human and mouse cells). As hybrids of human and mouse cells grow 

1 5 and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 

20 human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. e/ a/. (1983) 5c/e«ce 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

25 PGR mapping of somatic cell hybrids is a rapid procedure for assigning a 

particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the hVR-1, hVR-2, and rVR-2 
nucleotide sequences to design oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes. Other mapping strategies which 

30 can similarly be used to map an hVR-1, hVR-2, and rVR-2 sequence to its chromosome 
include in situ hybridization (described in Fan, Y. et al (1 990) Proc. Natl Acad. Sci. 
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USA, 87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 
5 one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
10 as short as 500 or 600 bases. However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1,000 bases, and more preferably 2.000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al. Human Chromosomes: A Manual of Basic Techniques 
15 (Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are for mapping purposes. Coding sequences 
20 are more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
25 Man, available on-line through Johns Hopkms University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in. for example. Egeland, J. et al. (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
30 unaffected with a disease associated with the hVR-1 . hVR-2, and rVR-2 gene, can be 
detennined. If a mutation is observed in some or all of the affected individuals but not 
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in any unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally involves 
first looking for structural alterations in the chromosomes, such as deletions or 
translocations that are visible from chromosome spreads or detectable using PGR based 
5 on that DNA sequence. Ultimately, complete sequencing of genes from several 
individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations fi-om polymorphisms. 

2. Tissue Typing 

10 The hVR- 1 , hVR-2, and rVR-2 sequences of the present invention can also be 

used to identify individuals from minute biological samples. The United States military, 
for example, is considering the use of restriction fragment length polymorphism (RFLP) 
for identification of its persomiel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 

1 5 unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 

20 altemative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the hVR-1, hVR-2, and rVR-2 
nucleotide sequences described herein can be used to prepare two PGR primers from the 
5' and 3' ends of the sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

25 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences firom individuals 
and from tissue. The hVR-1 . hVR-2, and rVR-2 nucleotide sequences of the invention 
30 xmiquely represent portions of the human genome. Allelic variation occurs to some 

degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the sequences described herein 
can, to some degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of polymorphisms occur 
in the noncoding regions, fewer sequences are necessary to differentiate individuals. 

If a panel of reagents from hVR-1, hVR-2, and rVR-2 nucleotide sequences 
described herein is used to generate a unique identification database for an individual, 
those same reagents can later be used to identify tissue from that individual. Using the 
unique identification database, positive identification of the individual, living or dead, 
can be made from extremely small tissue samples. 

3. Use of Partial hVR-1, hVR-2, and rVR-2 Sequences in Forensic Biology 
DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PGR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PGR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (Le. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Examples of polynucleotide reagents include the hVR-l, hVR-2, 
and rVR-2 nucleotide sequences or portions thereof, e.g., fragments derived firom SEQ 
ID NO: 1, 3, 4, 6, 7, 9, 10, or 1 1 having a length of at least 20 bases, preferably at least 
30 bases. 
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The hVR-1, hVR-2, and rVR-2 nucleotide sequences described herein can further 
be used to provide polynucleotide reagents, e.g., labeled or labelable probes which can 
be used in, for example, an in situ hybridization technique, to identify a specific tissue, 
e.g., brain tissue. This can be very useful in cases where a forensic pathologist is 
5 presented with a tissue of unknown origin. Panels of such hVR- 1 , hVR-2, and rVR-2 
probes can be used to identify tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., hVR-1, hVR-2, and rVR-2 primers or 
probes can be used to screen tissue culture for contamination (i.e. screen for the presence 
of a mixture of different types of cells in a culture). 

10 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 

1 5 Accordingly, one aspect of the present invention relates to diagnostic assays for 

determining hVR-1 , hVR-2, and rVR-2 protein and/or nucleic acid expression as well as 
hVR-1, hVR-2, and rVR-2 activity, in the context of a biological sample (e.g, blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant hVR- 

20 1 , hVR-2, and rVR-2 expression or activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. For example, mutations in an hVR-l, hVR-2, and rVR-2 gene can be assayed 
in a biological sample. Such assays can be used for prognostic or predictive purpose to 

25 thereby phophylactically treat an individual prior to the onset of a disorder characterized 
by or associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of hVR-1 , hVR-2, and rVR-2 in 
30 clinical trials. 

These and other agents are described in further detail in the following sections. 
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1. Diagnostic Assays 

An exemplary method for detecting the presence or absence of hVR-1, hVR-2, 
and rVR-2 protein or nucleic acid in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an 
5 agent capable of detecting hVR- 1 , hVR-2, and rVR-2 protein or nucleic acid (e,g. , 
mRNA, genomic DNA) that encodes hVR-1, hVR-2, and rVR-2 protein such that the 
presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is detected in the 
biological sample. A agent for detecting hVR-1, hVR-2, and rVR-2 mRNA or genomic 
DNA is a labeled nucleic acid probe capable of hybridizing to hVR-l, hVR-2, and rVR- 

10 2 mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
hVR- 1 , hVR-2, and rVR-2 nucleic acid, such as the nucleic acid of SEQ ID NO: 1,3,4, 
6, 7, 9, 10, or 12, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to hVR-1, hVR-2, and rVR-2 mRNA or genomic DNA. Other 

15 suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting hVR-1, hVR-2, and rVR-2 protein is an antibody capable 
of binding to hVR-1, hVR-2, and rVR-2 protein, preferably an antibody v/ith a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 
intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 

20 "labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
of the probe or antibody by coupling (/.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a fluorescently labeled secondary antibody and 

25 end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated firom a subject, as well as tissues, cells and fluids present 
within a subject. That is, the detection method of the invention can be used to detect 
hVR-1, hVR-2, and rVR-2 mRNA, protein, or genomic DNA in a biological sample in 

30 vitro as well as in vivo. For example, in vitro techniques for detection of hVR-1 , hVR-2, 
and rVR-2 mRNA include Northern hybridizations and in situ hybridizations. In vitro 
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techniques for detection of hVR-1 , hVR-2, and rVR-2 protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of hVR- 1 , hVR-2, and rVR-2 
genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
5 detection of hVR-1 , hVR-2, and rVR-2 protein include introducing into a subject a 
labeled anti-hVR-1, hVR-2, and rVR-2 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 

1 0 test subject. Alternatively, the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test subject. A biological sample is a 
serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 

15 compound or agent capable of detecting hVR-1 , hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA, such that the presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or 
genomic DNA is detected in the biological sample, and comparing the presence of hVR- 

1 , hVR-2, and rVR-2 protein, mRNA or genomic DNA in the control sample with the 
presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or genomic DNA in the test 

20 sample. 

The invention also encompasses kits for detecting the presence of hVR-I, hVR- 

2, and rVR-2 in a biological sample. For example, the kit can comprise a labeled 
compound or agent capable of detecting hVR-1 , hVR-2, and rVR-2 protein or mRNA in 
a biological sample; means for determining the amount of hVR-1, hVR-2, and rVR-2 in 

25 the sample; and means for comparing the amount of hVR-1, hVR-2, and rVR-2 in the 
sample with a standard. The compound or agent can be packaged in a suitable container. 
The kit can further comprise instructions for using the kit to detect hVR-1, hVR-2, and 
rVR-2 protein or nucleic acid. 
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2. Prognostic Assays 

The diagnostic methods described herein can ftulhermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with abericint 
hVR-1, hVR-2, and rVR-2 expression or activity. As used herein, the term "aberrant" 
5 includes an hVR-1, hVR-2, and rVR-2 expression or activity which deviates from the 
wild type hVR-1, hVR-2, and rVR-2 expression or activity. Aberrant expression or 
activity includes increased or decreased expression or activity, as well as expression or 
activity which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant hVR-1 , hVR-2, and rVR-2 

1 0 expression or activity is intended to include the cases in which a mutation in the hVR-1 , 
hVR-2, and rVR-2 gene causes the hVR-1, hVR-2, and rVR-2 gene to be under- 
expressed or over-expressed and situations in which such mutations result in a non- 
functional hVR-1, hVR-2, and rVR-2 protein or a protein which does not function in a 
wild-type fashion, e.g., a protein which does not interact with an hVR-1, hVR-2, and 

1 5 rVR-2 ligand or one which interacts with a non-hVR- 1 , non-hVR-2, and non-rVR-2 
ligand. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or 

20 nucleic acid expression, such as a pain disorder. Alternatively, the prognostic assays can 
be utilized to identify a subject having or at risk for developing a disorder associated 
with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or nucleic acid 
expression, such as a pain disorder. Thus, the present invention provides a method for 
identifying a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 

25 expression or activity in which a test sample is obtained from a subject and hVR-l, 
hVR-2, and rVR-2 protein or nucleic acid {e.g. , mRNA or genomic DNA) is detected, 
wherein the presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is diagnostic 
for a subject having or at risk of developing a disease or disorder associated with 
aberrant hVR-1 , hVR-2, and rVR-2 expression or activity. As used herein, a "test 

30 sample" refers to a biological sample obtained from a subject of interest. For example, a 
test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
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Furthermore, the prognostic assays described herein can be used to detennine 
whether a subject can be administered an agent {e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
5 expression or activity. For example, such methods can be used to determine whether a 
subject can be effectively treated with an agent for a pain disorder. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
expression or activity in which a test sample is obtained and hVR-1, hVR-2, and rVR-2 

1 0 protein or nucleic acid expression or activity is detected (e.g. , wherein the abundance of 
hVR-1, hVR-2, and rVR-2 protein or nucleic acid expression or activity is diagnostic for 
a subject that can be administered the agent to treat a disorder associated with aberrant 
hVR-K hVR-2, and rVR-2 expression or activity). 

The methods of the invention can also be used to detect genetic alterations in an 

15 hVR-1, hVR-2, and rVR-2 gene, thereby determining if a subject with the altered gene is 
at risk for a disorder characterized by misregulation in hVR-1, hVR-2, and rVR-2 
protein activity or nucleic acid expression, such as a neurodegenerative disorder. In 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic alteration characterized by at least one of an alteration 

20 affecting the integrity of a gene encoding an hVR- 1 , hVR-2, and rVR-2-protein, or the 
mis-expression of the hVR-1, hVR-2, and rVR-2 gene. For example, such genetic 
alterations can be detected by ascertaining the existence of at least one of 1) a deletion of 
one or more nucleotides from an hVR-1, hVR-2, and rVR-2 gene; 2) an addition of one 
or more nucleotides to an hVR-1, hVR-2, and rVR-2 gene; 3) a substitution of one or 

25 more nucleotides of an hVR-1 , hVR-2, and rVR-2 gene, 4) a chromosomal 

rearrangement of an hVR-1, hVR-2, and rVR-2 gene; 5) an alteration in the level of a 
messenger RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 6) aberrant 
modification of an hVR-1, hVR-2, and rVR-2 gene, such as of the methylation pattern of 
the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger 

30 RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 8) a non-wild type level of an 
hVR-1, hVR-2, and rVR-2-protein, 9) allelic loss of an hVR-1, hVR-2, and rVR-2 gene, 
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and 10) inappropriate post-translational modification of an hVR-1, hVR-2, and rVR-2- 
protein. As described herein, there are a large number of assays known in the art which 
can be used for detecting aherations in an hVR-1, hVR-2, and rVR-2 gene. A biological 
sample is a tissue or serum sample isolated by conventional means from a subject. 
5 In certain embodiments, detection of the alteration involves the use of a 

probe/primer in a polymerase chain reaction (PCR) (see, e.g,, U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241:1077-1080; 
and Nakazawa et al (1994) Proc, Natl Acad. Set USA 91 :360-364), the latter of which 

10 can be particularly useful for detecting point mutations in the hVR-1, hVR-2, and rVR- 
2-gene (see Abravaya et al (1995) Nucleic Acids Res .23:675-682), This method can 
include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
{C'g y genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to an hVR-1, hVR-2, and 

1 5 rVR-2 gene under conditions such that hybridization and amplification of the hVR-1 , 
hVR-2, and rVR-2-gene (if present) occurs, and delecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable 
to use as a preliminary amplification step in conjunction with any of the techniques used 

20 for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et al, (1990) Proc. Natl Acad. ScL USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et al (1988) Bio-Technology 6:1 197), or any 

25 other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well knovm to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in an hVR-1, hVR-2, and rVR-2 gene 

30 fi*om a sample cell can be identified by alterations in restriction enzyme cleavage 
pattems. For example, sample and control DNA is isolated, amplified (optionally), 
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digested with one or more restriction endonucleases, and fragment length sizes are 
detennined by gel electrophoresis and compared. Differences in fragment length sizes 
between sample and control DNA indicates mutations in the sample DNA. Moreover, 
the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) 
5 can be used to score for the presence of specific mutations by development or loss of a 
ribozyme cleavage site. 

In other embodiments, genetic mutations in hVR-1 , hVR-2, and rVR-2 can be 
identified by hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high 
density arrays containing hundreds or thousands of oligonucleotides probes (Cronin, 

10 M.T. et al (1 996) Human Mutation 7: 244-255; Kozal, M.J. et al (1 996) Nature 
Medicine 2: 753-759). For example, genetic mutations in hVR-1 , hVR-2, and rVR-2 
can be identified in two dimensional arrays containing light-generated DNA probes as 
described in Cronin, M.T. et al supra. Briefly, a first hybridization array of probes can 
be used to scan through long stretches of DNA in a sample and control to identify base 

15 changes between the sequences by making linear arrays of sequential overlapping 

probes. This step allows the identification of point mutations. This step is followed by 
a second hybridization array that allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one complementary to 

20 the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the hVR-1, hVR-2, and rVR-2 gene and detect 
mutations by comparing the sequence of the sample hVR-1, hVR-2, and rVR-2 with the 
corresponding wild-type (control) sequence. Examples of sequencing reactions include 

25 those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl Acad Set USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utiUzed 
when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 

30 94/16101; Cohene/a/. (1996) Adv. Chromatogr. 36:127-162; and Griffin e/ a/. (1993) 
Appl Biochem. Biotechnol 38:147-159). 
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Other methods for detecting mutations in the hVR-1, hVR-2, and rVR-2 gene 
include methods in which protection from cleavage agents is used to detect mismatched 
bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et ai (1985) Science 
230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
5 heteroduplexes of formed by hybridizing (labeled) RNA or DNA containing the wild- 
type hVR-1, hVR-2, and rVR-2 sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent 
which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. For instance, RNA/DNA 

1 0 duplexes can be treated with RNase and DNA/DNA hybrids treated with S 1 nuclease to 
enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest mismatched regions. After digestion of 
the mismatched regions, the resulting material is then separated by size on denaturing 

1 5 polyacrylamide gels to determine the site of mutation. See, for example, Cotton et al 
(1988) Proc, Natl Acad Sci USA 85:4397; Saleeba et al (1992) Methods Enzymol 
217:286-295. In a embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 

20 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in hVR-1, hVR-2, and rVR-2 cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine 
DNA glycosylase from HeLa cells cleaves T at Gil mismatches (Hsu et al (1994) 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based 

25 on an hVR-1, hVR-2, and rVR-2 sequence, e.g., a wild-type hVR-1, hVR-2, and rVR-2 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, for example, U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in hVR-1, hVR-2, and rVR-2 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect differences in electrophoretic 
mobility between mutant and wild type nucleic acids (orita e( al (1989) Proc Natl 
5 Acad. Sci USA: 86:2766, see also Cotton (1993) MuiaL Res, 285:125-144; and Hayashi 
(1992) GeneL Ami Tech, Appl 9:73-79). Single-stranded DNA fragments of sample 
and control hVR-1, hVR-2, and rVR-2 nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of 

10 even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), 
in which the secondary structure is more sensitive to a change in sequence. In a 
embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 

15 (Keen et al (1991) Trends Genet 7:5), 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al (1 985) Nature 3 1 3:495). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 

20 completely denature, for example by adding a GC clamp of approximately 40 bp of 

high-melting GC-rich DNA by PGR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265: 12753). 

Examples of other techniques for detecting point mutations include, but are not 

25 limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al (1986) Nature 
324:163); Saiki etd, (1989) Proc, Natl Acad Sci USA 86:6230). Such allele specific 

30 oligonucleotides are hybridized to PGR amplified target DNA or a number of different 
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mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PGR amplification may be used in conjunction with the instant invention. 
5 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et ai (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 

1 0 to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al (1992) Mol Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5' sequence making it possible to detect 

1 5 the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
20 patients exhibiting symptoms or family history of a disease or illness involving an hVR- 
UhVR-2, and rVR-2 gene. 

Furthermore, any cell type or tissue in which hVR-1, hVR-2, and rVR-2 is 
expressed may be utilized in the prognostic assays described herein. 

25 3. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs) on the expression or activity of 
an hVR-1, hVR-2, and rVR-2 protein can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent determined by a 
screening assay as described herein to increase hVR-1 , hVR-2, and rVR-2 gene 

30 expression, protein levels, or upregulate hVR-1, hVR-2, and rVR.2 activity, can be 
monitored in clinical trials of subjects exhibiting decreased hVR-1, hVR-2, and rVR-2 
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gene expression, protein levels, or downregulated hVR-1, hVR-2, and rVR-2 activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
hVR-1, hVR-2, and rVR-2 gene expression, protein levels, or downregulate hVR-1, 
hVR-2, and rVR.2 activity, can be monitored in clinical trials of subjects exhibiting 
5 increased hVR- 1 , hVR-2, and rVR-2 gene expression, protein levels, or upregulated 

hVR-l» hVR-2i and rVR-2 activity. In suoh glinioal trialei tht •xprasBion or aetivity of 

an hVR-1, hVR-2, and rVR-2 gene, and preferably, other genes that have been 
implicated in, for example, an hVR-1, hVR.2, and rVR-2-associated disorder can be 
used as a "read out" or markers of the phenotype of a particular cell. 

1 0 For example, and not by way of limitation, genes, including hVR- 1 , hVR-2, and 

rVR-2, that are modulated in cells by treatment with an agent {e.g., compound, drug or 
small molecule) which modulates hVR-1, hVR-2, and rVR-2 activity (e.g., identified in 
a screening assay as described herein) can be identified. Thus, to study the effect of 
agents on hVR-1, hVR-2, and rVR.2.associated disorders (e.g., pain disorders), for 

15 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of hVR-1, hVR-2, and rVR-2 and other genes implicated in the 
hVR-l, hVR-2, and rVR-2-associated disorder, respectively. The levels of gene 
expression {e.g., a gene expression pattern) can be quantified by northern blot analysis 
or RT-PCR, as described herein, or alternatively by measuring the amount of protein 

20 produced, by one of the methods as described herein, or by measuring the levels of 
activity of hVR-1, hVR-2, and rVR-2 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the 
agent. Accordingly, this response state may be determined before, and at various points 
during treatment of the individual with the agent. 

25 In a embodiment, the present invention provides a method for monitoring the 

effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) including the steps of (i) obtaining a 
pre-administration sample from a subject prior to administration of the agent; (ii) 

30 detecting the level of expression of an hVR-l , hVR-2, and rVR-2 protein, mRNA, or 
genomic DN A in the preadministration sample; (iii) obtaining one or more post- 
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administration samples from the subject; (iv) detecting the level of expression or activity 
of the hVR-1, hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of expression or activity of the hVR-1, 
hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the pre-administration sample 
5 with the h VR- 1 , h VR-2, and rVR-2 protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of hVR-1, hVR-2, and rVR-2 to higher 
levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, 
1 0 decreased administration of the agent may be desirable to decrease expression or activity 
of hVR-1, hVR-2, and rVR-2 to lower levels than detected, Le. to decrease the 
effectiveness of the agent. According to such an embodiment, hVR-1, hVR-2, and rVR- 
2 expression or activity may be used as an indicator of the effectiveness of an agent, 
even in the absence of an observable phenotypic response. 

15 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant hVR-1, hVR-2, and rVR-2 expression or activity. With regards to both 

20 prophylactic and therapeutic methods of treatment, such treatments may be specifically 
tailored or modified, based on knowledge obtained from the field of pharmacogenomics. 
"Pharmacogenomics", as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 

25 how a patient's genes determine his or her response to a drug {e.g. , a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 
invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the hVR-1, hVR-2, and rVR-2 molecules of the present invention 
or hVR-1, hVR-2, and rVR-2 modulators according to that individual's drug response 

30 genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 



wo 00/29577 PCT/US99/26701 

-84- 

therapeutic treatments to patients who will most benefit from the treatment and to avoid 
treatment of patients who will experience toxic drug-related side effects. 

1. Prophylactic Methods 
5 In one aspect, the invention provides a method for preventing in a subject, a 

disease or condition associated with an aberrant hVR-1, hVR-2, and rVR-2 expression 
or activity, by administering to the subject an hVR-1, hVR-2, and rVR-2 or an agent 
which modulates hVR-1, hVR-2, and rVR-2 expression or at least one hVR-1, hVR-2, 
and rVR-2 activity. Subjects at risk for a disease which is caused or contributed to by 

1 0 aberrant hVR- 1 , hVR-2, and rVR-2 expression or activity can be identified by, for 
example, any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the hVR-1, hVR-2, and rVR-2 aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of hVR- 

15 1 , hVR-2, and rVR-2 aberrancy, for example, an hVR- 1 , hVR-2, and rVR-2, hVR- 1 , 
hVR-2, and rVR-2 agonist or hVR-1, hVR-2, and rVR-2 antagonist agent can be used 
for treating the subject. The appropriate agent can be determined based on screening 
assays described herein. 

20 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating hVR-1, hVR- 
2, and rVR-2 expression or activity for therapeutic purposes. Accordingly, in an 
exemplary embodiment, the modulatory method of the mvention involves contacting a 
cell with an hVR-1, hVR-2, and rVR-2 or agent that modulates one or more of the 

25 activities of hVR-1, hVR-2, and rVR-2 protein activity associated with the cell. An 
agent that modulates hVR-1, hVR-2, and rVR-2 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occuiring target 
molecule of an hVR-1, hVR-2, and rVR-2 protein (e.g., an hVR-1, hVR-2, and rVR-2 
substrate), an hVR-1, hVR-2, and rVR-2 antibody, an hVR-1, hVR-2, and rVR-2 agonist 

30 or antagonist, a peptidomimetic of an hVR-1, hVR-2, and rVR-2 agonist or antagonist, 
or other small molecule. In one embodiment, the agent stimulates one or more hVR-1, 
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hVR-2, and rVR-2 activities. Examples of such stimulatory agents include active hVR- 

1, hVR-2, and rVR-2 protein and a nucleic acid molecule encoding hVR-1, hVR-2, and 
rVR-2 that has been introduced into the cell. In another embodiment, the agent inhibits 
one or more hVR-1, hVR-2, and rVR-2 activities. Examples of such inhibitory agents 

5 include antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecules, anti-hVR-1, hVR- 

2, and rVR-2 antibodies, and hVR-1, hVR-2, and rVR-2 inhibitors. These modulatory 
methods can be performed in vitro {e,g,, by culturing the cell with the agent) or, 
alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 

1 0 characterized by aberrant expression or activity of an hVR- 1 , hVR-2, and rVR-2 protein 
or nucleic acid molecule. In one embodiment, the method involves administering an 
agent {e.g., an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g., uprcgulates or downregulates) hVR-1, hVR-2, and rVR-2 
expression or activity. In another embodiment, the method involves administering an 

15 hVR-1, hVR-2, and rVR-2 protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant hVR-1, hVR-2, and rVR-2 expression or activity. 

Stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which hVR-1, hVR-2, and rVR-2 is abnormally downregulated and/or in which 
increased hVR-1 , hVR-2, and rVR-2 activity is likely to have a beneficial effect. For 

20 example, stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which an hVR-l, hVR-2, and rVR-2 is dovwiregulated and/or in which increased hVR-1, 
hVR-2, and rVR-2 activity is likely to have a beneficial effect. Likewise, inhibition of 
hVR-1, hVR-2, and rVR-2 activity is desirable in situations in which hVR-1, hVR-2, 
and rVR-2 is abnormally upregulated and/or in which decreased hVR-1, hVR-2, and 

25 rVR-2 activity is likely to have a beneficial effect. 

3. Pharmacogenomics 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention, as well as 
agents, or modulators which have a stimulatory or inhibitory effect on hVR-1, hVR-2, 
30 and rVR-2 activity (e.g. , hVR- 1 , hVR-2, and rVR-2 gene expression) as identified by a 
screening assay described herein can be administered to individuals to treat 
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(prophylactically or therapeutically) hVR-1, hVR-2, and rVR-2-associated disorders 
(e.g., pain disorders) associated with aberrant hVR-1, hVR-2, and rVR-2 activity. In 
conjunction with such treatment, pharmacogenomics (/.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
5 or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, 
10 hVR-2, and rVR-2 modulator as well as tailoring the dosage and/or therapeutic regimen 
of treatment with an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR-.2, and rVR-2 
modulator, 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

15 persons. See, for example, Eichelbaum, M. et al (1996) Clin, Exp, Pharmacol Physiol. 
23(10-1 1) :983-985 and Linder, M.W. et al (1997) Clin, Chem, 43(2):254-266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the way the body 

20 acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 

25 fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
30 sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
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significant number of patients taking part in a Phase Will drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
5 "SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 

1 0 individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 

1 5 encodes a drugs target is known (e.g. , an hVR- 1 , hVR-2, and rVR-2 protein of the 

present invention), all common variants of that gene can be fairly easily identified in the 
population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

20 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

25 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite fi-equently 

30 experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 



If 
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demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 
5 Alternatively, a method termed the "gene expression profiling", can be utilized to 

identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a drug {e.g., an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR- 
2, and rVR-2 modulator of the present invention) can give an indication whether gene 
pathways related to toxicity have been turned on. 

10 Information generated from more than one of the above pharmacogenomics 

approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with an hVR-1 , 

1 5 hVR-2, and rVR-2 molecule or hVR- 1 , hVR-2, and rVR-2 modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
20 applications cited throughout this application, as well as the Figures and the Sequence 
Listing are incorporated herein by reference. 



EXAMPLES 



25 EXAMPLE 1: IDENTIFICATION AND CHARACTERIZATION 

OF hVR.l, hVR-2, and rVR.2 cDNA 

In this example, the identification and characterization of the genes encoding 
hVR-1 (clone Fchrb87a6), hVR-2 (clone flh21el 1), hVR-2 alternate form (clone 
fi1iobl2c4), and rVR-2 (clone flrxbl47gll) are described. 



30 
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Isolation of the hVR-l, hVR-2, and the rVR-2 cDNA 

The invention is based, at least in part, on the discovery of tv^o human genes and 
one rat gene encoding novel members of the CapsaicinA^anilloid receptor family, 
referred to herein as hVR-1, hVR-2, and rVR-2, respectively. These clones were 
5 identified from a human heart library and a rat dorsal root ganglion (DRG) library, based 
on sequence homology to the known rat VR-1 (Accession Number AF0293 10). The 
sequence of the two human clones and the rat clone was determined and found to 
contain open reading frames. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted amino 
10 acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID NOs:l 
and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID N0s:4 
and 5, respectively. 

15 The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 

predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 

Figure 3 and in SEQ ID N0s:7 and 8, respectively. 

The amino acid sequence of the predicted full length human VR-2 protein 

(alternate form) is shown in Figure 16 and in SEQ ID NO:20. 
20 The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 

acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 

and 11, respectively. 

Analysis of the hVR-1, hVR-2, and rVR-2 Molecules 

25 The hVR-1 protein (SEQ ID N0:2) was aligned with the human VR-2 protein 

(SEQ ID N0:5) using the GAP program in the GCG software package (Blosum 62 
matrix) and a gap weight of 12 and a length weight of 4. The results showed a 46.348% 
identity and 55.378% similarity between the two sequences (see Figure 5). 

The hVR-l nucleotide sequence (SEQ ID NO: 1) was aligned with the human 

30 VR-2 nucleotide sequence (SEQ ID N0:4) using the GAP program in the GCG software 
package (nwsgapdna matrix) and a gap weight of 50 and a length weight of 3. The 
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results showed a 55.3 16% identity and 55.3 16% similarity between the two sequences 
(see Figure 6). 

The hVR-2 protein (SEQ ID N0:5) was aligned with the rat VR-2 protein (SEQ 
ID NO:l 1) using the CLUSTAL W (1.74) multiple sequence alignment program (Figure 
5 7), as well as using the GAP program in the GCG software package (Blosum 62 matrix) 
and a gap weight of 12 and a length weight of 4. The results showed a 79.167% identity 
and 81.703% similarity between the two sequences (see Figure 8). 

The hVR-1 nucleotide sequence (SEQ ID NO: I) was aligned with the rat VR-1 
nucleotide sequence (Accession Number: AF0293 10) using the GAP program in the 
1 0 GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length weight 
of 3. The results showed a 82.125% identity and 82.125% similarity between the two 
sequences (see Figure 9). 

The hVR-1 protein (SEQ ID N0:2) was aligned with the rat VR-l protein 
(Accession Number: AF0293 10) using the GAP program in the GCG software package 
1 5 (Blosum 62 matrix) and a gap weight of 1 2 and a length weight of 4. The results showed 
a 86.022% identity and 89.247% similarity between the two sequences (see Figure 10). 

The hVR-2 protein (SEQ ID N0:5) was aligned with the human VR-2 protein 
(alternate form) (SEQ ID N0:8) using the CLUSTAL W (1 .74) multiple sequence 
alignment program (Figure 1 1). 
20 Finally, the hVR-2 protein (SEQ ID N0:5) was aligned with the predicted full 

length human VR-2 protein (alternate form) (SEQ ID NO:20) using the CLUSTAL W 
(1 .74) multiple sequence alignment program (Figure 17). 

A search was performed against the HMM database resulting in the identification 
of three ankyrin repeat domains in the amino acid sequence of hVR-1 (SEQ ID N0:2) at 
25 about residues 201-233, 248-283, and 333-361, and in the amino acid sequence of hVR- 
2 (SEQ ID N0:5) at about residues 162-194, 208-243, and 293-328. The results of the 
searches are set forth in Figures 13 and 15, respectively. 

Hydropathy plots have identified 6 transmembrane domains in the hVR-1 and 
the hVR-2 proteins (see Figures 12 and 14, respectively). 
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A series of searches have revealed that the hVR-1 protein matches the ProDom 
entry 141801 for the vanilloid receptor subtype and the ProDom entry 145518 for the 
vanilloid receptor subtype. 

Moreover, a search was performed against the Prosite database resulting in the 
5 identification of four N-glycosylation sites in the amino acid sequence of SEQ ID N0:5 
(at about residues 171-174, 192-195, 604-607, and 749-752), three cGMP-dependent 
protein kinase phosphorylation sites in the amino acid sequence of SEQ ID N0:5 (at 
about residues 2-5, 368-371, and 499-502), a series of protein kinase C and Casein 
kinase II phosphorylation sites in the amino acid sequence of SEQ ID N0:5, tv^o 
1 0 tyrosine kinase phosphorylation sites in the amino acid sequence of SEQ ID N0:5 (at 
about residues 368-375 and 622-628), and two myristoylation sites in the amino acid 
sequence of SEQ ID N0:5 (at about residues 169-174 and 765-770). 

Tissue Distribution of hVR-1 and hVR-2 mRNA 

15 This Example describes the tissue distribution of hVR-1 and hVR-2 mRNA as 

determined by in situ hybridization. 

For in situ analysis, tissues, such as brain regions and whole brain, obtained from 
human and monkey were first frozen on dry ice. Ten-micrometer-thick coronal sections 
of the tissues were postfixed with 4% formaldehyde in DEPC treated IX phosphate- 

20 buffered saline at room temperature for 10 minutes before being rinsed twice in DEPC 
IX phosphate-buffered saline and once in 0.1 M triethanolamine-HCl (pH 8.0), 
Following incubation in 0.25% acetic anhydride-0.1 M triethanolamine-HCl for 10 
minutes, sections were rinsed in DEPC 2X SSC (IX SSC is 0.15M NaCl plus 0.015M 
sodium citrate). Tissue was then dehydrated through a series of ethanol washes, 

25 incubated in 100% chloroform for 5 minutes, and then rinsed in 100% ethanol for 1 
minute and 95% ethanol for 1 minute and allowed to air dry. 

Hybridizations were performed with 35S-radiolabeled (5 X 10^ cpm/ml) cRNA 
probes. Probes were incubated in the presence of a solution containing 600 mM NaCl, 
10 mM Tris (pH 7.5), I mM EDTA, 0.01% sheared salmon sperm DNA, 0.01% yeast 

30 tRNA, 0.05% yeast total RNA type XI, 1 X Denhardt's solution, 50% formamide, 10% 
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dextran sulfate, 100 mM dithiothreitoL 0.1% sodium dodecyl sulfate (SDS), and 0.1% 
sodium thiosulfate for 18 hours at 55°C. 

After hybridization, slides were washed with 2 X SSC. Sections were then 
sequentially incubated at 37°C in TNE (a solution containing 10 mM Tris-HCl (pH 7.6), 
5 500 mM NaCl, and 1 mM EDTA), for 10 minutes, in TNE with lO^g of RNase A per ml 
for 30 minutes, and finally in TNE for 10 minutes. Slides were then rinsed with 2 X 
SSC at room temperature, washed with 2 X SSC at 50°C for 1 hour, washed with 0.2 X 
SSC at 55°C for 1 hour, and 0.2 X SSC at 60°C for 1 hour. Sections were then 
dehydrated rapidly through serial ethanol-0.3 M sodium acetate concentrations before 

10 being air dried and exposed to Kodak Biomax MR scientific imaging film for 24 hours 
and subsequently dipped in NB-2 photoemulsion and exposed at 4°C for 7 days before 
being developed and counter stained. 

The data indicate that the hVR-1 molecule is not expressed in human nor 
monkey brain. The hVR-1 molecule is expressed in nodose, trigeminal sensory neurons, 

1 5 but is not expressed in sympathetic neurons. Within the nodose sensory neurons and 
trigeminal sensory neurons, expression was seen in distinct sub-populations. Moreover, 
hVRl is expressed in some, but not all, small dorsal root ganglion (DRG) neurons and in 
a few medium sized DRG neurons. The hVR-1 molecule is partially co-expressed with 
the neuropeptide CGRP and with substance P which are present in nociceptive neurons. 

20 The data further indicate that the VR-2 molecule is expressed in both human and 

monkey brain, primarily in cortical neurons. The VR2 molecule is also expressed in 
other brain regions, for example, the thalamus, striatum, hippocampus, hypothalamus, 
midbrain, medula and brain stem. In addition, the VR-2 molecule is expressed in 
parasympathetic neurons of the monkey heart (atrium), nodose sensory neurons, 

25 trigeminal (TRG) sensory neurons, dorsal root ganglion sensory neurons, sympathetic 
neurons, and motor neurons of the spinal cord. The VR2 molecule is widely expressed 
in TRG and DRG neurons, being present in most small and medium sized neurons and 
also in a few of the large neurons. VR2, like VR-1 , partially co-localizes with CGRP 
and substance P. 
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Trigeminal sensory neurons are recognized pain centers while sympathetic 
neurons have been shown to be involved in neuropathic pain. 



EXAMPLE 2: EXPRESSION OF RECOMBINANT hVR-1, hVR.2, AND 
5 rVR.2 PROTEIN IN BACTERIAL CELLS 

In this example, hVR-1, hVR-2, and rVR-2 is expressed as a recombinant 
glutathione-S-transferase (GST) fusion polypeptide in £. coli and the fusion polypeptide 
is isolated and characterized. Specifically, hVR-1, hVR-2, and rVR-2 is fused to GST 
and this fusion polypeptide is expressed in E. coli, e,g,, strain FEB 199. Expression of 
1 0 the GST-hVR- 1 , GST-hVR-2, and GST-rVR-2 fusion protein in FEB 1 99 is induced 
with IPTG. The recombinant fusion polypeptide is purified from crude bacterial lysates 
of the induced FEB 1 99 strain by affinity chromatography on glutathione beads. Using 
polyacrylamide gel electrophoretic analysis of the polypeptide purified from the 
bacterial lysates, the molecular weight of the resultant fusion polypeptide is determined. 

15 

EXAMPLE 3; EXPRESSION OF RECOMBINANT hVR-1, hVR-2, 
AND rVR-2 PROTEIN IN COS CELLS 

To express the hVR-1, hVR-2, and rVR-2 gene in COS cells, the pcDN A/Amp 
vector from Invitrogen Corporation (San Diego, CA) is used. This vector contains an 

20 S V40 origin of replication, an ampicillin resistance gene, an E, coli replication origin, a 
CMV promoter followed by a polyUnker region, and an S V40 intron and 
polyadenylation site, A DNA fragment encoding the entire hVR-1, hVR-2, and rVR-2 
protein and an HA tag (Wilson et al (1984) Cell 37:767) or a FLAG tag fused in-frame 
to its 3' end of the fragment is cloned into the polylinker region of the vector, thereby 

25 placing the expression of the recombinant protein under the control of the CMV 
promoter. 

To construct the plasmid, the hVR-1, hVR-2, and rVR-2 DNA sequence is 
amplified by PCR using two primers. The 5' primer contains the restriction site of 
interest followed by approximately twenty nucleotides of the hVR-1, hVR-2, and rVR-2 
30 coding sequence starting fi-om the initiation codon; the 3' end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop 
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codon, the HA tag or FLAG tag and the last 20 nucleotides of the hVR-1, hVR-2, and 
rVR-2 coding sequence. The PGR amplified fragment and the pCDNA/Amp vector are 
digested with the appropriate restriction enzymes and the vector is dephosphorylated 
using the CIAP enzyme (New England Biolabs, Beverly, MA). Preferably the two 
5 restriction sites chosen are different so that the hVR- 1 , hVR-2, and rVR-2 gene is 
inserted in the correct orientation. The ligation mixture is transformed into E. coli cells 
(strains HBlOl, DH5a, SURE, available from Stratagene Cloning Systems, La Jolla, 
CA, can be used), the transformed culture is plated on ampicillin media plates, and 
resistant colonies are selected. Plasmid DNA is isolated from transformants and 

10 examined by restriction analysis for the presence of the correct fragment. 

COS cells are subsequently transfected with the hVR-1, hVR-2, and rVR-2- 
pcDNA/Amp plasmid DNA using the calcium phosphate or calcium chloride co- 
precipitation methods, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Other suitable methods for transfecting host cells can be found in 

15 Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989. The expression of the hVR-1, hVR-2, and rVR-2 polypeptide 
is detected by radiolabelling (^^S-methionine or 35s-cysteine available from NEN, 
Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. 

20 Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells are 
labelled for 8 hours with ^^S-methionine (or ^^S-cysteine). The culture media are then 
collected and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP- 
40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culture 

25 media are precipitated with an HA specific monoclonal antibody. Precipitated 
polypeptides are then analyzed by SDS-PAGE. 

Alternatively, DNA containing the hVR-1, hVR-2, and rVR-2 coding sequence 
is cloned directly into the polylinker of the pCDNA/Amp vector using the appropriate 
restriction sites. The resulting plasmid is transfected into COS cells in the manner 

30 described above, and the expression of the hVR-1 , hVR-2, and rVR-2 polypeptide is 
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detected by radiolabelling and immunoprecipitation using an hVR-1, hVR-2, and rVR-2 
specific monoclonal antibody. 

EXAMPLE 4: ELECTROPHYSIOLOGICAL STUDIES OF VR2 

5 Human VR2 was functionally characterized in both HEK293 cells and Xenopus 

oocytes using electrophysiological methods. VR2 (in the pcDNA3.l vector purchased 
by Invitrogen) was transiently expressed in HEK293 cells (ATCC) and recordings were 
performed 48 hours after transfection of cells using the whole-cell patch-clamp method 
(described in Bertil Hille, lonin Channels of excitable membranes, 1992; Hammill et al 

10 {\9Z\)Pluger Arch, 391 :85-100). The results indicate that heat stimulation (>50 °C) 
induces a rapid inactivating inward current (1-2 nA). Heat-evoked currents of VR2 
displayed profound desensitization and could be reversibly blocked by the VRl inhibitor 
capsazepin (at a 10 concentration). In contrast to rat VRl, Capsaicin (at a 1-10 |aM 
concentration), resiniferatoxin (at a 0.1-3 |iM concentration), and low pH (5.0-6.0) do 

15 not induce any currents from VR2. Binding studies of pH]-resiniferatoxin (NEN) to 
both human VRl and VR2 in membranes isolated from HEK293 cell homogenates also 
indicate that resiniferatoxin (at a 0.1-10 nM concentration) has no specific binding to 
VR2 while it binds to human VRl with high affinities. 

For the oocyte studies, human VR2 was subcloned into an oocyte expression 

20 vector containing 5'- and 3'-UTR of Xenopus p-globin (Chiara et al (1999) 

Biochemistry 38(20)6689-6698). In vitro trasncription was carried out as described in 
Chiara et al {supra) and cRNA (10-100 ng) was then injected into the oocytes. VR2 
function was characterized in the oocytes 48 hours after cRNA injection using a standard 
two-electrode voltage-clamp. Consistent with the data from the HEK293 studies, VR2 

25 can only be activated by heat stimulation (48-50 **C) but not by vanilloid receptor 

agonists, capsaicin, or resiniferatoxin. The vanilloid receptor antagonist capsazepine (at 
a 1-10 jiM concentration) blocks the heat response of VR2 reversibly. 
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EXAMPLE 5: GENERATION OF ANTI-hVR-2 ANTIBODIES AND hVR-2 
PROTEIN LOCALIZATION BY IMMUNOSTAINING 

Polyclonal antisera were raised in rabbits against the following three peptides 
derived from the human VR2 amino acid sequence, using the techniques described in Ed 
5 Harlow and David Lane (1988) "Antibodies; A Laboratory Manual" Cold Spring harbor 
Laboratory Press. 

Antibody PEPTIDE 1: AFHCKSPHRHRMVVLE (SEQ ID NO: 13) 
Antibody PEPTIDE 2: RPEAPTGPNATESVQPMEGQEDEGN (SEQ ID NO: 14) 
Antibody PEPTIDE 3: SVLEMENGYWWCRKKQRAG (SEQ ID N0:15) 
1 0 Antisera were subsequently affinity purified using the peptide immunogen. 

The polyclonal antisera were tested for immunostaining of both monkey and rat 
dorsal root ganglion sensory neurons. Peptides 1 and 3 gave specific staining of 
subpopulations of sensory neurons that was competed with the corresponding peptide. 
This pattern of expression was very similar to the one observed using a VR-2 riboprobe. 

15 

EXAMPLE 6: CHROMOSOMAL LOCALIZATION OF hVR-1 AND 
hVR-2 

To chromosomally map the hVR-1 gene, primers were designed based on the 
sequence of hVR-l (clone Fchrb87a6) (amplifying a 177 bp product from a human 

20 control cell line DNA and muhiple faint larger products from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 
Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-1 primers used in the PCR mapping studies were: forward - 
TAGGAGACCCCGTTGCCACG (SEQ ID NO: 16) and reverse - 

25 GATTCACTTGGGGACAGTGACG (SEQ ID NO: 1 7) and the PCR reactions were 
performed as follows: 5 ^l Template DNA (lOng/til), 1.5^1 lOX Perkin Elmer PCR 
Buffer, I2\xl Pharmacia dNTP mix 2.5 mM, 1.15|il Forward primer 6.6nM, 1.15^1 
Reverse primer 6.6^M, 5|il Gibco/BRL Platinum Taq .05U/|il (Hot Start), using an 
amplification profile of: 95°C for 10 minutes followed by 35 Cycles of 94°C for 40 

30 seconds, 55**C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
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products were run on 2% agarose gels, post-stainedwith SYBR Gold (1:10,000 dilution 
in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 

The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1 -93. A "T' is a positive result, a is a negative result, a is an ambiguous 
5 result. 

hVRl 1 - . 1 ?. 1 - 1 - 1 1 - 1 - 1 1 - - 1 -1 11 - 1 1 -1 1 I ... 

1---1 1 1 1-1--1 1 I-1-1---1 1-1 1 

10 RH linkage analysis was performed using the Map Manager QTb28 software package. 
hVRl was found to map to the p arm of human chromosome 1 7, 1 8.9 cRjqoq 
telomeric to the Whitehead Institute framework marker WI-6584, and 7.7 cRjoqo 
centromeric of the Whitehead framework marker WI-5436. LOD scores for linkage 
were 14.5 for WI-6584 and 19.3 for WI-5436. This region corresponds to the 

15 cytogenetic location 17pl2-13. This region is syntenic to mouse chromosome 11. 

To chromosomally map the hVR-2 gene, primers were designed from 5' UTR 
sequence of human VR2 (clone Flh21el 1) (amplifying a 166 bp product from a human 
control cell line DNA and 2 much larger faint bands from a control Hamster cell line 
DNA by PGR). These primers were used to amplify 93 DNAs in duplicate from the 

20 Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 
The hVR-2 primers used in the PGR mapping studies were: forward - 
TTAAGCTCCCGTTCTCACCG (SEQ ID NO: 18) and reverse - 
GCTGCGGGAGGAAGTGAAGC (SEQ ID NO: 19) and the PGR reactions were 
performed as follows: 5\i\ Template DNA (lOng/fil), 1.5^x1 lOX Perkin Elmer PGR 

25 Buffer, 1 .2^1 Pharmacia dNTP mix 2.5mM, 1 . 1 5^1 Forward primer 6.6\xM, 1 . 1 5^1 
Reverse primer 6.6^M, 5|iil Gibco/BRL Platinum Taq .05U/|al (Hot Start), using an 
amplification profile of 95**G for 10 minutes, followed by 35 Gycles of 94°C for 40 
seconds, 55**G for 40 seconds, 72°G for 40 seconds, and 72°G for 5 minutes. The PGR 
products were run on 2% agarose gels, post-stainedwith SYBR Gold (1 : 10,000 dilution 

30 in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 
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The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are . 
in order 1-93. A "1" is a positive result, a is a negative result, a is an ambiguous 
result. 

5 hVR2 1 - - 1 1 - ? 1 1 1 - - 1 - 1 1 1 - 1 1 1 1 - - 1 1 1 - 1 11 - - 1 

1 1 1 l-.-l-l 1-1-1 II I l-.-l 1-1-1-?-.-? 

RH linkage analysis v/as performed using the Map Manager QTb28 software package. 

1 0 hVR2 was found to map to the p arm of human chromosome 1 7 , 29.3cR cRjooo 

telomeric to the Whitehead Institute framework marker Dl 7S 721, and 23.3 CR3000 
centromeric of the Whitehead framework marker AFMA043ZB5. LOD scores for 
linkage were 1 1 .9 for D17S72I and 13.6 for AFMA043ZB5. This region corresponds to 
the cytogenetic location 17pl 1-12. This region is syntenic to mouse chromosome 1 1 . 

15 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
20 claims. 
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1 . An isolated nucleic acid molecule selected from the group consisting of: 

(a) a nucleic acid molecule comprising the nucleotide sequence set 
5 forth in SEQ ID NO: 1 , 3, 4, 6, 7, 9, 1 0, or 1 2 or a complement thereof; and 

(b) a nucleic acid molecule consisting of the nucleotide sequence set 
forth in SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

2. An isolated nucleic acid molecule which encodes a polypeptide selected 
1 0 from the group consisting of: 

(a) a polypeptide comprising the amino acid sequence set forth in 
SEQ IDNO:2,5,8, or 11; and 

(b) a polypeptide consisting of the amino acid sequence set forth in 
SEQIDNO:2,5, 8,orll. 

15 

3. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2,5, 8, or 11. 
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d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID N0:2, 5, 8, or 11, wherein 
the fragment comprises at least 15 contiguous amino acid residues of the amino acid 
sequence of SEQ ID N0:2, 5, 8, or 1 1. 

5 

5. An isolated nucleic acid molecule comprising the nucleic acid molecule 
of any one of claims 1, 2, 3, or 4, and a nucleotide sequence encoding a heterologous 
polypeptide. 

10 6. A vector comprising the nucleic acid molecule of any one of claims 1, 2, 

3, or 4. 

7. The vector of claim 6, which is an expression vector. 
15 8. A host cell transfected with the expression vector of claim 7. 



9. A method of expressing a polypeptide comprising culturing the host cell 
of claim 8 in an appropriate culture medium to, thereby, express the polypeptide. 
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10. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID N0:2, 5, 8, or 1 1, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID N0:2, 5, 8, or 11; 

5 b) a naturally occurring allelic variant of a polypeptide comprising 

the amino acid sequence of SEQ ID N0:2, 5, 8, or 1 1 , wdierein the polypeptide is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
10 comprising a nucleotide sequence which is at least 83% identical to a nucleic acid 

comprising the nucleotide sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, or 12; and 

d) a polypeptide comprising an amino acid sequence which is at least 
87% identical to the amino acid sequence of SEQ ID N0:2, 5, 8, or 1 1 . 

15 11. The isolated polypeptide of claim 1 0 comprising the amino acid sequence 

of SEQ IDNO:2,5,8, or 11. 

12. The polypeptide of claim 10, further comprising heterologous amino acid 
sequences. 

20 

13. An antibody which selectively binds to a polypeptide of claim 10. 

14. A method for detecting the presence of a polypeptide of claim 10 in a 
sample comprising: 

25 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 10 in the sample. 



30 



15. The method of claim 14, wherein the compound which binds to the 
polypeptide is an antibody. 
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16. A kit comprising a compound which selectively binds to a polypeptide of 
claim 10 and instructions for use. 

5 17, A method for detecting the presence of a nucleic acid molecule of any 

one of claims 1 , 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
10 nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 

molecule of any one of claims 1 , 2, 3, or 4 in the sample. 

18. The method of claim 17, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

15 

19. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

20. A method for identifying a compound which binds to a polypeptide of 
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22. A method for modulating the activity of a polypeptide of claim 10 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

23. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 10 comprising: 

a) contacting a polypeptide of claim 10 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 

24. A method for treating a subject having a disorder characterized by 
aberrant hVR-1 or hVR-2 protein activity or nucleic acid expression comprising 
administering to the subject a hVR-1 or hVR-2 modulator such that treatment of the 
subject occurs. 

25. The method of claim 24, wherein the hVR- 1 or hVR-2 modulator is a 
small molecule. 



26. The method of claim 24, wherein the disorder is a pain disorder. 
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16. A kit comprising a compound which selectively binds to a polypeptide of 
claim 10 and instructions for use. 

5 17. A method for detecting the presence of a nucleic acid molecule of any 

one of claims 1 , 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
10 nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 

molecule of any one of claims 1 , 2, 3, or 4 in the sample. 

18. The method of claim 17, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

15 

1 9. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

20. A method for identifying a compound which binds to a polypeptide of 
20 claim 10 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide 
with a test compound; and 

b) determining whether the polypeptide binds to the test compound, 

25 21. The method of claim 20, wherein the binding of the test compound to the 

polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay; and 

30 c) detection of binding using an assay for hVR- 1 , hVR-2, or rVR-2 

activity. 
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22. A method for modulating the activity of a polypeptide of claim 1 0 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

5 

23. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 10 comprising: 

a) contacting a polypeptide of claim 10 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
10 polypeptide to thereby identify a compound which modulates the activity of the 

polypeptide. 

24. A method for treating a subject having a disorder characterized by 
aberrant hVR-I or hVR-2 protein activity or nucleic acid expression comprising 

1 5 administering to the subject a hVR-1 or hVR-2 modulator such that treatment of the 
subject occurs. 

25. The method of claim 24, wherein the hVR- 1 or hVR-2 modulator is a 
small molecule. 

20 

26. The method of claim 24, wherein the disorder is a pain disorder. 
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humanVRl gene with translation of open reading frame 

Input file Fchrbe7a6 .seq; Output File Fchrbe7a£ . tra 
Sequence length 3 90S 

GTGAGCGCAACGCACTGCXSGGaVGTGAGCGCAACGCyiCTGCGGGCJlGT^ 

CGCACTGCXllGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGaiCTGC^ 

GCAGTGAGCGCJU^CGCACTTGCGG<XJVGTGAG0GCAACGCACIX5CGGGC^^ 

GCAACGCACTGCGGGCAGTGAGCGCAACGCACTCCGGGCAGTGAGCGCAACGCACrrOCGGGCAGTC 
GCGGGCy^GTGAGaSCAACGCACTGCGGGCAGTGAGCGOlAOGCACTGC^^ 

AGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTTAATGTGAGTTAGCTCACTOVTTAGGCACCCCAGGCT^ 

TTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCyvCACAGGAAAC^^ 

ACGCCAAGCTCTAATACGACTCACTATAGGGAAAGCTGGTACGCCTGCAGGTACCGGTCCGGAATTC^ 

ACGCXSTCOSAAAACACy^CCTCTCTCCTGTGGGAAGACTC 

<n«CTCT6GGGAGAATTCGTAGATCATCCTCAGAAAAGCCTTGCcir^ 

CCAGAAAAGCTGTCCACAGTAGTCCCCCCTTATCCACXSGGTGTCAC 

GG«rnX:CAATATTAAATGGAAAATTCTTCAAACAGTTCCCAAGTTTO 

AaAGTCTCTCCCGTG<XyVTCTGGGATGCAAACCGTCC<^ 

AfllXSVCXTAGTCGTCAGATCGCCCGTCCTGGTATCACAGTGCT^^ 

MKKHSSTDLGTAADPLQK 16 
GCAAGG ATG AAG AAA TGG AGO AGO ACA OAC TTG GGG ACA GCT GC6 GAC CCA CTC CAA AAG 54 

OTCPDPLD6DPNS RPPPAKP 38 
QAC AGO TGC CCA GAC CCC CTS GAT GGA GAC CCT AAC TCC AOG CCA CCT CCA OCC AAG CCC 114 

QLP*i'AKSRTRLFGRGDSBEA 56 
CAfl CTC CCC ACG OCC AAG AOC C6C ACC CGG CTC TTT GGG AAO OGT GAC TOG GAG GAG GCT 174 

PPVDCPH .BBGBX1D8CFTZTV 70 
TTC CCG GTO GAT TOC CCC CAC GAG GAA GOT GAG TTG GAC TCC TOC CCO ACC ATC ACA GTC 234 

8PVXTIQRPGDGPTGARLL8 96 
AOC CCT OTT ATC ACC ATC CAO AGO CCA GGA GAC OGC CCC ACC GOT GCC AGO CTO CTO TCC 29A 

QDS^A ASTBK TLRLYDRRei US 

gag gac tct gtc occ gcc aoc aoc gag aao acc ctc aog ctc tat gat cgc aog aot atc 354 

feavIaqnncqdlbslllfiiQ 
ttt gaa gcc gtt gct cag aat aac toc cao gat ctg gag aoc cto ctc ctc ttc cto cao 414 
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KSKKHLTDNEFKDPETGKTC 156 
AAG AGC AAG AAG CAC CTC ACA GAC AAC GAG TTC AAA GAC CCT GAG ACA GGG AAG ACC TGT 474 



LLKAMLNLHDGQNTTIPLLL 178 

CTG CTG AAA GCC ATG CTC AAC CTG CAC GAC GGA CAG AAC ACC ACC ATC CCC CTG CTC CTG 534 

EIARQTDSLKELVNASYTD S 198 

GAG ATC GCG CGG CAA ACG GAC AGC CTG AAG GAG CTT GTC AAC GCC AGC TAC ACG GAC AGC 594 

YYKGOTALHIAIERRNMALV 218 

TAC TAC AAG GGC CAG ACA GCA CTG CAC ATC GCC ATC GAG AGA CGC AAC ATG GCC CTG GTG 6 54 

TLLVENGADVOAAAHGDFFK 238 

ACC CTC CTG GTG GAG AAC GGA GCA GAC GTC CAG GCT GCG GCC CAT GGG GAC TTC TTT AAG 714 

KTKGRPGFYPGELPLSLAAC 258 

AAA ACC AAA GGG CGG CCT GGA TTC TAC TTC GGT GAA CTG CCC CTG TCC CTG GCC GCG TGC 7 74 

T N Q LG I V KFLLQNS WQ TAD I 278 

ACC AAC CAG CTG GGC ATC GTG AAG TTC CTG CTG CAG AAC TCC TGG CAG ACG GCC GAC ATC 834 

SARDSVGNTVLHALVBVADN 298 

AGC GCC AGG GAC TCG GTG GGC AAC ACG GTG CTG CAC GCC CTG GTG GAG GTG GCC GAC AAC 894 

TADNTKFVTSMYHEILM L GA 318 

ACQ GCC GAC AAC ACG AAG TTT GTG ACG AGC ATG TAC AAT GAG ATT CTG ATG CTG iSGG GCC .954 

KLHPTLKLEEIiTN KKGMTPL 338 

AAA CTG CAC CCG ACG CTG AAG CTG GAG GAG CTC ACC AAC AAG AAG GGA ATG ACG CCG CTG 1014 

ALAAGTOKIGVLAYIIiQREI 358 

GCT CTG GCA GCT GGG ACC GGG AAG ATC GGG GTC TTG GCC TAT ATT CTC CAG CGG GAG ATC 1074 

QE PECRHLfiRKFTEKAYOPV 378 

CAG GAG CCC GAG TGC AGG CAC CTG TCC AGG AAG TTC ACC GAG TGG GCC TAC GGG CCC GTG 1134 

HeSL YDLfiCIDTCEKNSVLB 398 

CAC TCC TCG era TAC GAC CTG TCC TGC ATC GAC ACC TGC GAG AAG AAC TCG GTG CTG GAG 1194 

VIAYS8SETPNRBDMLLVEP 418 

OXO ATC GCC TAC AGC AGC AOC GAG ACC CCT AAT CGC CAC GAC ATG CTC TTG GTG GAG COG 1254 

LNRLLQDKWDRPVKRIFYFN 438 

CTOAACCGACTCCTOCAGGACAAGTGGGACAGATTCGTCAAGCGCATCTTCTACTTCAAC 1314 

PLVYCbYMIIPTMAAYYRPV 458 

TTC era GTC-TAC TOC CTG TAC ATO ATC ATC TTC ACC ATG GCT GCC TAC TAC AOG CCC GTG 1374 

DGLPPFKMBKIODYPRVTOB 478 

GAT GGC TTQ CCT CCC TTT AAG ATG GAA AAA ATT OOA GAC TAT TTC COA GTT ACT GGA GAG 1434 

ILSVLGdVYFPFRGIQYPLQ 498 

ATC CTC TCT GTG TTA GGA GGA GTC TAC TTC TTT TTC CGA GGG ATT CAG TAT TTC CTG CAG 1494 
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RRPSMKTLFVDSYSEMLFFL 518 

AOG CGG CCG TCG ATC AAG ACC CTG TTT GTG QAC AGC TAG AGT GAG ATG CTT TTC TTT CTG 1554 

QSLFMLATVVLVFSHLKEYV 538 

CAG TCA CTG TTC ATG CTG GCC ACC GTG GTG CTG TAC TTC AGC CAC CTC AAG GAG TAT GTG 1614 

j^SMVFSLALGWTNMLYYTRG 558 

GCT TCC ATG GTA TTC TCC CTG GCC TTG GGC TGG ACC AAC ATG CTC TAC TAC ACC CGC GGT 1674 

PQQMGIYAVMIEKMILRDLC 578 

TTC CAG CAG ATG GGC ATC TAT GCC GTC ATG ATA GAG AAG ATG ATC CTG AGA GAC CTG TGC 1734 

RFMFVYIVFLFGFSTAVVTL 599 

CGT rrC ATC TTT GTC TAC ATC GTC TTC TTG TTC GGG TTT TCC ACA GCG GTG GTG ACG CTG 1794 

lEDGKNDSLPSESTSHRWRG 618 

ATT GAA GAC GGG AAG AAT GAC TCC CTG CCG TCT GAG TCC ACG TCG CAC AGG TGG CGG GGG 1654 

PACRPPDSSYNSLYSTCLEL 638 

CCT GCC TGC AGG CCC CCC GAT AGC TCC TAC AAC AGC CTG TAC TCC ACC TGC CTG GAG CTG 1914 

PKPTIGMGDLEFTENYDFKA 658 

TTC AAG TTC ACC ATC GGC ATG GGC GAC CTG GAG TTC ACT GAG AAC TAT GAC TTC AAG GCT 1974 

VPIILLLAYVILT YILLLNM 676 

GTC TTC ATC ATC CTG CTG CTG GCC TAT GTA ATT CTC ACC TAC ATC CTC CTG CTC AAC ATG 2034 

• lALMGETVNK lAQESKNIW 698 

CTC ATC GCC CTC ATG GGT GAG ACT GTC AAC AAG ATC GCA CAG GAG AGC AAG AAC ATC TGG 2094 

ICLQR AITILDTEKSFLKCMR 718 

AAG CTG CAG AGA GCC ATC ACC ATC CTC GAC ACG GAG AAG AGC TTC CTT AAG TGC ATG AGG 2154 

KAFRSGKLLQVGYTPDGKDD 738 

AAG GCC TO CGC TCA GGC AAG CTC CTG CAG GTG GGG TAC ACA CCT GAT GGC AAG GAC GAC 2214 

YRWCFRVDEVNWTTWNTNVG 758 

TAC CGG TOG TTC AGG GTC GAC GAG GTC AAC TGG ACC ACC TGO AAC ACC AAC GTO GGC 2274 

llMBDPOHC BGVKRTLSPfil- 778 

ATC ATC AAC GAA GAC CCG GOC AAC TOT GAG GGC ore AAG CGC ACC CTC AGC TTC TOC CTC 2334 

RfiflRVSGRHHKHFALVPLLR 798 

COO TCA AGC AGA GTT TCA GGC AGA CAC TCG AAG AAC TTT GCC CTC? ore CCC CT^ 2394 

KASARDRQfiAQPEEVyLRQF 818 

GMGCAAOTGCTCGAGATAGOCAOTCTOCTCAGCCCGAOGAAGTTTATCTOCGACAGTTT 2454 

flOfi LKPBDAEVPKSPAASOB 838 

TCAGGaTCTCTOAMCCAGAGGACGCTGAGOTCTTCAAOAOTCCrOCCGCTTW 2S14 

840 

K * 2520 
AAG TGA 
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Pull -length human VR2 

Input file FlhSlell.aeq; Output File Flb2iell.tra 
Sequence length 2609 

qOCTftQCCTOTCCTQ^C3U3Q(XaAatf«TTi^^ 
OOCCTCAOCCTOCGOGOCTCCAGTCAGGCCAACACC^ 

NT6PSSSP 6 

TGCACkGAOGmrrOQCrGQACOC^^ ATG ACXT TCA CCC TCC AGC TCT CCA 24 

VFRLETIiOOOQEDGSEAORQ 2e 

OTT TTC AOG TTO GAG ACA TTA GAT GOA OOC CAA GAA GAT OOC TCI OAG GOG GAC AGA GGA 84 

RLDPGSGLPPNESQFQGEDR 46 

AAG CTO GAT TTT GOG AGC GGO CTO OCT CCC ATG OAG TCA CAG TTC CAG GGC GAG GAC OGG 144 

KFAPQIRVNLNYRKGTGASQ 66 

AAA TTC GCC CCT CAO ATA AGA GTC AAC CTC AAC TAG OGA AAG GGA ACA GOT GCC AGT CAG 204 

PDPHRFDRDRtFNAVSR. GVP 66 

COS GAT CCA AAC OGA TTT GAC OGA GAT OGG CTC TTC AAT GOO GTC TCC OSG GOT GTC CCC 264 

EBLAOLPBYLSKTSKYLTDS X08 

OAG GAT CTO OCT GGA CTT CCA GAG TAC CTG AGC AAG ACC AGC AAG TAG CTC ACC GAC TOO 324 

BYT EOSTOICTCLMKAVLNtK 128 

GAA TAC ACA GAG OOC TCC ACA GGT.AAG ACQ TOC CTO ATO AAG OCT OTO CTO AAC CTT AAG 384 

DG VMACILPtLQiDRDSGNP 148 

OAC OGA OTC AAT GCC TGC ATT CTO CCA CTO CTC CAG ATC GAC AGG GAC TCT GGC AAT OCT 444 

QPLVHAOCTDOV^R^J HSALH l€e 

CAG CCC CTC OtA AAT GCC CRG TOC ACA GAT GAC TAT TAC OSA GGC CAC AGC OCT CTO CAC 504 

lAXEKRStOCVKLLVE MGAN 188 

MC OCC ATT GAG AAG AGO AOT CTO CAO TOT OTG AAG CTC CTO OTO GAO AAT OGG GCC AAT 564 

V HARACORPPQKO0OTCFTP 20e 

OTO CAT OOC 09G OOC TOC OGC OOC TTC TTC CAO AAA OOC C3A OGG ACT TOC TTT TAT TTC €24 

O BLPl.«I.A ACT'KQ tlDV VBTI* 228 

OOT OAO CTA COC CTC TCT TTO OOC OCT TOC ACC AAG CAG 1X30 OAT OTO Om ADC TAC CTC 684 

LBHPHOPAfiLQA.TOSQOHT V 248 

CTO OAG AAC OCA CAC CAO OOC OOC AOC CTO CAO OOC ACT OAC IOC GAG OOC AAC ACA OTC 744 

IiHA LVKISDHS ABHIA LVTS 268 

CTO CAT OOC cm OTO ATO ATC tOO OAC AAC TCA OCT GAO AAC ATT OCA CTO OTO AOC AGC 804 

H TDOLLOAOARI<CPTV QLBD. 288 

ATO TAT OAT 009 CIC CTC GAA OCT OOO OOC 030 CTC TOC OCT ACC OTO CAO CZT OAS OAC 664 

XRHL0I>I*TPtKl»AAKBOKIB 308 
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AlC OOC JUIC Cro CAO OAT CTt: AGO CCT CTO AAC CTO <Kr CKX: Wtf^ 924 

jFRHlhQ^SfSO hSHhSRKF 328 

ATT TTC AGO CAC ATC Cro CAC CGG GM rrr TCA CK» Cro AGC CAC CTT 984 

TEHCrOPVR VSLYDI'ASVDS 346 

ACC QWI TOO W TAT 000 <nt: COO OTO TC» CTO TAT GAC CTO 1044 

CBEMSVLEIIAFHCKSPHRH 3(6 

TOT OAO GAO AAC TC» (OT €TO GAG ATC AIT <KX TTT CAT TGC AAG AOC OOG CAC 1104 

RMVVLEPLMKLLQAKWDLLI 366 

COAATCOTCGrrTTCQAOCCCCTOAACAAAOTCroCAaGOGAAATOGGATCTOCTCA 11€4 

PKFPLNF LCNLIVMPIFTAV 408 

CCC AAG TTC rrC TTA AAC rrC CTO TOT AAT CTG ATC TAC ATO TTC ATC TTC ACC OCT GIT 1224 

AYHQPTLKKOAAPHLKAEVO 428 

GCC TAC CAT CAO OCT ACC CTC AAG AAG CAG GCC GCC CCT CAC CTC AAA GCG GAG GTT GGA 1264 

H SMLLTOHILILLGOIYLL V 448 

AAC TCC ATC CIX3 CTC ACG GGC CAC ATC CIT ATC CTC CTA OGG GOG ATC TAC CTC CTC GTC 1344 

rtftiWYPHRRHVFlWlfi^'IDS 468 

GGC Cis CTO TOO TAC TTC TGG COG CGC CAC CTO rrc ATC TOG ATC TCG TTC ATA GAC AOC 1404 

YPEILFLPQALLTVV. SQVLC 468 

Jci^TGLATCCTCrrCCTCrrCCAGGOCCTCCTCACAGTO<m;TCCCAGG^ 1464 

TOcJ^0«ATO<MTOGTACCroCCCCroCTTGTC?CTGOG^ 1S24 

«l.I.YYTROFQHTOlYCVMIQ 528 

ALc^C?rJci^TAoSTGGCTTOCA0CACACAO0CATCTACACX<m:ATOATOCA^ 1564 

JLg^atoc^c5o<^cc^c^oLt?cc5^™atoiIc^^^ I'l! 

OEAHRP EAPTG "SCS 

«*,«oDMBGQBDBOHOA 568 

* « , n V B a H V X. L t t L A €26 

oJc*LjJia^oScJtoa»ci«wo«ocjm:TWJ«cio<^ »«« 

T V 0 T r P D 0 P D B R * C F R V 8 B 708 
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Partial human VR2 alternate form 

Input file £ rhobl 2c4 .ceq; Output File frhobl2c4-tra 
Sequence length 1469 

ORFPQKOQOTCFYFOEL'PL is 

OC OOC OQC TTC TTC CAO MG 03C CAA OOG ACT TOC TTT TAT TTC OOT GAO CTA CCC CTC SI 

SLAACTKQHDVVSYLLEKPH 39 

TCT TTG GCC GCT TGC ACC AAC CAO TOO GAT CTO OTA AGC TAC CTC CTO GAG AAC CCA CAC 117 

QPASLQATOSQGNTVLHALV 59 

GAG CCC OCC AOC CTC GAG GCC ACT CAC TCC CAO OOC AAC ACA QTC CTO CAT GCC CTA GTO 17? 

MISDMSA EN IALVTSMYDGL 79 

ATG ATC TOO (SAC AAC TCA OCT GAG AAC ATT GCA CTC GTG ACC AOC ATG TAT GAT GOG CTC 237 

LOAOARLCPTVOLEDIRN LQ 99 

CTC CAA GCT GGG GCC OGC CTC TGC CCT ACC GTO CAO CTT GAG OAC ATC OOC AAC CTO CAG 297 

DLTPLKLAAK.EGKIEIFRHI lis 

OAT CTC ACQ CCT CTO AAG CTG GCC GCC AAO GAG GOC AAG ATC GAO ATT TTC AGG CAC ATC 357 

LOREFSGliSBtSRKPTBHcy 139 

CTG CAG CX3G GAO TXT TCA OGA CTO AOC CAC CIT TCC OSA AAO TTC ACC GAO TGG TGC TAT 417 

OPVRVSLYDL AEVOSCEElfS 159 

OOG CCT ore OGG GTG TOO CTG TAT OAC CTO GCT TCT .OTS GAC AOC TOT GAG GAG AAC TCA 477 

VLBIIAFBCKfiPHRHRMVVL 179 

GTO CTG GAO ATC AIT GOC ITT CAT TGC AAO AOC COS CAC OSA CAC OQA ATS GTC GTT TTG S37 

EPLN|CZiI.Q AKNOt.LIFKFFL 199 

GAO CCC CTO AAC AAA CTC CTO CAC GOO AAA TOO GAT CTG CTC ATC COC AAO TTC TTC TTA 597 

NPLCHLXYt4FlPTAVAYHQP 219 

AAC TTC CTO TOT AAT CTG ATC TAC ATO TTC ATC TTC ACC GCT OTT OCC TAC CAT CAO CCT €57 

TL .KKQAAPBLKAEVONSHLL 239 

ACC CTO AAO AAO CAO GOC OCC CCT CAC CTO AAA OOG GAO GTT OOA AAC TCC ATS CTO CTQ 717 

TOHILILLGGIYLI.VO QZ.HY 259 

AOG OOC CAC ATC CTT ATC CTO CTA G03 000 ATC TAC CTC CTC OTS OOC CAO CTO TOO TAC 777 

-P- Hlt RBVPXIfX SPZOflTPSXL - 379" 

TTC1OOO9OCOCCACOXaTXCAXClO9JTOTaiTTCAXAaAeA0C13lC^ 637 

PLPQALLTVVSQV tiCPL A Z B 299 

TTC CTO TTC CAO OOC CTO CTC Ad OTS OTS TOC GAO OTS CTO TOT TTC CTO OOC ATC GAO 897 

NTLPLL VEAIiVLOirLNLLTY 319- 

TOO TAC CTO OOC CTO CTT GTO TCT OOS CTS GTO CIO OOC TOO CTO AAC CT3 CTT TAC TAT 957 

TROFQBTOZrOVHX. QKK AZfi 339 
ACA qOT OOC TTC GAO GAC ACA OOC AXC TAB AOT OTC ASG ASC OUI AAO AAA GOC AIC TCT 1017 

V. LBHBHOTNVCftKKQRAGVH 359 

OTC era GAO AZO GAO AAT OOC TAT TOO TOO TOC MG AAG AM CAO GOO OCA GOT 0T9 ATO 10.77 
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Partial rat VR2 

Input file Flrxbl479ll.6eq; Output File Flrxbl47gll . tr« 
Sequence length 1794 

STHASALfifcAACTKQWOVV 

G TOO ACC OIC OOC TCC OCT CTT TCT CTO OCT COS TC3C ACC AAG CAO TOO OAT 0T3 OTO 57 

TYLLENPHOPASLBATDSLC 39 

ACC TAC CTC CTO GAG AAC CCA CAC GAG CCG GCC AGC CTG GAG OCC ACC GAC TCC CTG GGC 117 

NTVLHALVHIADNSPENSAL 59 

AAC ACA CTC CTO CAT OCT CTG CTA ATG ATT GCA GAT AAC TOO CCT GAG AAC AOT GCC CTG 177 

VIHMYDG LLOMGARLCPTVQ 79 

CTC ATC CAC AIX! TAC GAC GOG CTT CTA CAA ATO OGO COG OOC CTC TGC CCC ACT GTG GAG 237 

LEEISHHQOLTPLKLAAKEG 99 

CTT GAG GAA ATC TCC AAC CAC CAA GGC CTC ACA CCC CTG AAA CTA GCC GCC AAG GAA GGC 297 

KIEIFRHILORBFSGPYQPL 119 

AAA ATC GAG ATT TTC AGG CAC ATT CTG CAG CGG GAA TTC TCA GGA CCG TAC CAG CCC CTT 357 

SRKFTBMCYOPVRVfiLYDLS 139 

TCC OGA AAG rPT ACT GAG TX» TGT TAC GGT OCT GTG OGO GTA TOG CTO TAC GAC CTO TCC 417 

SVDSWEK NSVLEIIAFHCKS 159 

TCT GTG CAC AGC TOG GAA AAC AAC TOO GTG CTO GAG ATC ATC OCT TTT CAT TOC AAG AGC 477 

PMRHRHVVLEPLHKLLQEKW 179 

000 AAC 000 <aC OOC ATO CTO CTT TTA GAA CCA CTG AAC AAG CTT CTG CAO GAG AAA TOG 537 

DR LVfiRPFPHPACYLVYMFI 199 

GATOOGCTCGTCTCAACATTCTTC TTCAAC TTC GCC TOC TAC TTG GTC TAC ATG TTC ATC S97 

FTVVAYHQPfil-DOPAIPSSK 219 

• TTC ACC ore GTT OCC TAC CAC CAG CCT TCC CTO GAT CAG OCA OCC ATC COC TCA TCA AAA €57 

ATFOBSMLLLOBIL ILLGGI 239 

000 ACT m 000 GAA TCC ATO CTG CTG CTO GGC CAC ATT CTG ATC CTG CTT GOO GOT ATT 717 

YLLLOQLKTPHRRRI'PIWIS 259 

TACCrCTTACTOOOCaWCTGTOGTACTrTTOOOOaOOOOOCCTOTTTATCTOOATCTC^ 777 

F H D fi- T" P -E I L F L I. 0 A L L T V L fi 279 

TTCXTOOACAOCTWTTTOAAASCCTCTTTCPCOTCROOCrCrOCrCAtt 837 

CROOlOCIOOOCTICATOGMACrCAATOaTRC CDl^OOC CTO CTA OTO TBI TOC CTA OTO aP7 

LOMLHLLYTTROFQHTOITS 319 

crooacTOocroAACcwcw™:TO:ACAC(»<wcrrTCAOCRCA 957 

VHlQXVXhVLPhhVifhhVrLy 339 

OTCjaoAicawaAMcrrcAScciToaAGACCPOcrcoorTrcciociooT^ ioi7 

PLFOPAVALVfitfi«tBARflPK 359 

TTOCiTTreoocrfTocr<nAoa:ciAomji3CTTOAflCAaAGAoaocoaAAOTCocAA^ 1077 
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GAP of: humanvr2.pep check: 5746 from; 1 to: 764 
humanVR2 Flh21ell 

to: humanvrl.pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum. 

Symbol comparison table: 
/dc3m_local/gcg/gcg_9. l/gcgcore/data/rundata/blosum62.cmp 

CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1530 Length: 850 

Ratio: 2.003 Gaps: 10 

Percent Similarity: 55.378 Percent Identity: 46.348 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
; = 2 
. = 1 

humanvr2.pep x humanvrl.pep 

T ' ' ! MTSPSSSPVF 10 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

• • * * * 

11 RLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 60 

: : Ml : • * ' 

51 GKGDSEEAFPVDCPHEEGELDSCPTI.TVSPVITIQRPGDGPTGARLLSQ 99 

61 RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEY 110 
100 DSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEF 149 

Ill TEGSTGKTCLMKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDY 160 

: lllllhlMII II I I 11:1 I ... Mil M • ^„ 
150 KDPETGKTCLLKAMLNLHDGQNTTIPLLLEIARQTDSLKELVNASYTDSY 199 

« . • * • 

161 YRGHSAMIMEKRSLQCVKLLVENGANVHARACGRFFQKGQG.TCFYre 209 

hi .IMIIIhl-: I IHIIII.I I I I IM -I ^ 
200 YKGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYPG 249 

♦ 

210 ELPLSLAACTKQWDWSYLLENPHQPASLQATDSQGNTVIiHALVMISDNS 259 

till Mint I :l :ll:l I I : I M 1 1 1 Ml H I • • H • _ 
250 ELPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNT 299 

260 MNIALVTSMYDGLLQAGAIOiCPTVQLEDIRNWDLTPUaJ^ 309 

|:| Mill. :l 11:1 II.. II:: I * :lll Ml Ml : 
300 ADNTKmSKYHEIIiffliGAiaHPTUaiEEIiTNKKGOTPLAL^ 349 
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310 FRHILQREFS..GLSHLSRKFTEWCyGPVRVSLVDLASVDSCEENSVLEI 357 

. II II I I III Hill INI Mill, :l.ll•lllll•• 

350 LAYILQREIQEPECRHLSRKFTEWAyGPVHSSLYDLSCIDTCEKNSVLEV 399 

358 lAF.HCKSPHRHRMWLEPLNKLLQAKWDLLIPK.FFLNFLCNLIYMFlF 405 

II: ..Ml I. ..1111:111 Ml : : h Ml :IJ II 
400 lAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIF 449 

406 TAVAYHQPTLKKQAAPHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFW 455 

I I I:. I I .:l. .11 M .111:1 : II 

450 TMAAyYRPV..DGLPPFKMEKIGDYFRVTGEILSVI,GGVYFFFRGlQYFL 497 

456 RRHVFIWISFIDSYFEILFLFQALLTWSQVLCFLAIEWYLPLLVSALVL 505 



I . 1:111 i.M I.I . ■ 'I i..::J.;..J_L;J.j 

498 



QRRPSMKTLFVDSYSEMLFFLQSLFMLATVVLYFSHLKEYVASMVFSLAL 54 7 



606 



644 



506 GWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLLIYLVFLFGFAVALVS 555 

II hlMMIII lll.lihl.lMII II: --I: I- I- I- 

548 GWTNMLYYTRGFQQMGIYAVMIEKMILBDLCRFMFVYIVFLFGFSTAVVT 597 

556 LSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLELFKFTl 605 

... , , . I I . I : I I II II I I 

598 LIEDGKNDSLPSESTSHRWRGPACRPPD....SSYNSI-YSTCLELFKFTI 643 

GMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATD 655 

|||:l II I: . ::llllll:IIIIIIIIMII I : I ; 

GMGDLEFTENYDFKAVFI ILLLAYVILTYILLLNMLIALMGETVNKIAQE 693 

656 SWSIWKWKAISVLEtffiNGyvmC.WCKQRAGVta.TVGTKPlX3SPDERWCF 704 

I IIIM-M :l: I : III |.l :l II Ml Mill 
694 SK^JIWKLQ^lTILDTEKSE^KOiBKAFRSGKLLQVGYTPDGKDDYRWCF 743 

705 RVEEVNWASWEQTLPTLCEDPSGA.GVPRTLENPVLASPPKEDEDGASEE 753 

I I'll II .1 • 'IN II III • ' • „. 

744 RVDEVNWTTWNTNVGIINEDPGNCEGVKRTLSFSLRSS RVSGRHWK 789 

754 NYVPVQLLQSN 

790 iFALVPLLREASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 
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GAP of: humanvr2.seq check: 8853 from: 1 to: 2809 
humanVR2 2l€lla, 2809 bases, 8853 checksum. 

to: humanvrl.seq check: 4554 from: 1 to: 3909 

humanVRl Fbhl8547pat - Import - complete 

Symbol comparison table: . 
/dclm_local/gcg/gcg_9 , 1 /gcgcore/data/rundata/nwsgapdna . cmp 
CompCheck: 8760 

Gap Weight: 50 Average Match; 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: i4359 Length: 3934 

Ratio: 5.112 Gaps: 15 

Percent Similarity: 55.316 Percent Identity: 55.316 

Match display thresholds for the alignment (s ) : 
I = IDENTITY 
: = 5 
. » 1 

humanvr2.8eq x humanvrl.seq 



1 GGCTAGCCTGTCCTGACAGGGGAGAG 26 

I I I I I M lit 

801 TGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCA 850 

27 TTAAGCTCCCGTTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGTGA 76 

Mill! III! II 1 III II 

851 GTTATTTGCGGTCAACCACGGTCTGCCAATATTAAATGGAAAATTCTTCA 900 
■ * • • ■ • 

77 CCGAGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAG 126 

II III I t I tl I I II Ml I I II 
901 AACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGAAG 950 

' 

127 GGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCCGGCAGCCCCTCCCGGCTT 176 

I MM I .11 I I Ml I I 

951 AGTCTCTGCCGTGCCaVTCTGGGATGCAAACCGTCCCTGTGTCCCCCACGX 1000 

177 CACTTCCTCCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGC 226 

I II Mi II Ml Ml It I I I 

1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCA6ATCGCC 1050 

227 CGCCACGCCCTGGC CTCAGCCTGCGGG 253 

-,111111 M HI I I 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 
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254 GCTCCAGTCAGGCCAACACCGACGCGCAGCTGGGAGGAAG 293 

1 I I I I t III I I I I I t I M I 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC XI 50 

294 ..'..ACAGGACCCTTGACATCTCCATCTGCACAGAGGTCCTG 331 

I Mill Mil tl "I I Ml Ml 
1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 

332 GCTGGACCGAGCAGCCTCCTCCTCCTAGGATGACCTCACCCTCCAGC..T 379 
III t M I II I I M I M Ml 

1201 ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 



360 CTCCAGTTTTCAGGTTGGAGACATTAGATGGAGGCCAAGAAGATGGCTCT 

It I I I M I I t I I t l!l I 

1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 

4 30 GAGGCGGACAGAGGAAAGCTGGATTTTGGGAGCGGGCTGCCTCCCAT6GA 
I II I I I I I II I I I 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 



1250 
429 
1300 
479 
1350 
529 



579 

1446 

629 



4 80 GTCACAGTTCCAGGGCGAGGACCGGAAATTCGCCCCTCAGATAAGAGTCA 

I I Ml MM I IM t -IM I I I 
1351 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGG..C 1396 

5 30 ACCTCAACTACCGAAAGGGAACAGGTGCCAGTCAGCCGGATCCAAACCGA 

11 I I III M II I I I I 

1399 TGCTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTC 

580 TTTGACCGAGATCGGCTCTTCAATGCGGTCTCCCGGGGTGTCCCCGAGGA 

I Itl It I MM I II II I I ) t t ' l"< 

1449 TATGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGA 1498 

€30 TCTGGCTGGACTTCCAGAGTACCTGAGCAAGACCAGCAAGTACCTCACCG €79 
,,111 1)11 Mill M M It M II M II M I 
1499 TCTG<kGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACA^ 1548 

680 ACTCGGAAtAcACAGAGGGCTCCACAGGTMGACGTGCCTGATG^ 729 
n II t II Ml Mill MIM M IM MM II 

1S4 9 aCAACGAGTTCAAAGACCCTGAGACAGGGAAGAC^^^ 1S98 

730 GTGCTGAACCTTAAGGACGGAGTOU^TGCCTGCATTCTGCC^ 779 

1 1 M M I n M M n I M M MM I M M M 
1599 ATGCTCAACCTGCACGACC^CAGAACACCACCATC^^ 1646 



629 

1^98 

679 



760 GATCGAOVGGGACTCTGGCAATCCTCAGCCCCTGGTAAATi^^ 

IMM M I I Mt I M M tl II Ml I M 
1649 GATCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGC^ 

830 cagatgactAttaccgagg^cacagcgctctg^^ 

III MMM lIMM It MMMMIIMII MM 
1699 icGACftGCTACTACAAGGGCCAGAC^ 

880 AGGAGTCTGCAGTGTGTGAAGCTCCTGGTGGAGAATGGGGCCRATG^ 929 

II M Hit I M I M M M II II M M M M I 

1749 cGCAACATGGCCCTGGTGACCCTOrTGGTGGAGAACG^ 1798 

930 tGCCCGGGCCTGCGGCCGCTTCTTCCAGaAgGGCCAAG...GG^^ 976 
H nil It MMM Mil MM MMM 
1799 gGCTGCGC^CCATGGGGACTTCTTTAAGAAAACC^ 1846 
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977 TTTATTTCGGTGAGCTACCCCTCTCTTTGGCOGCTTGCACCAAGCAGTGG 1026 
1 II tlllllll II mil II lllllll llllllll III I 
184 9 TCTACTTCGGI'GAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTG 1898 

1027 GATGTGGTAAGCTACCTCCTGGAGAACCCACACCAGCCCGCCAGCCTGCA 1076 

t I It I I III Ml llllt I IN lilt II 
1899 GGCATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAG 1948 

1077 GGCCACTGACTCCCAGGGCAACACAGTCCTGCATGCCCTAGTGATG ATCT 1126 

Nil llllt .IMItllll It Mill Kill III II 
194 9 CGCCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGG 1998 

1127 CGGACAACTCAGCTGAGAACATTGCACTGGTGACCAGCATGTATGATGGG 1176 

I llllll I ft II MM I MM! MIMMI Ml I 

1999 CCGACAACAC6GCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAG 2048 

1177 CTCCTCCAAGCTGGGGCCCGCCTCTGCCCTACCGTGCAGCTTGAGGACAT 1226 

III llllll II Mill II I It I II I M I 

204 9 ATTCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCT 2098 

. • * • • 

1227 CCGCAACCTGCAGGATCTCACGCCTCTGAAGCTGGCCGCCAAGGAGGGCA 1276 

I MM I Ml I IMM Ml IMM It I III 
2099 CACCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGA 2148 

• • " • • 
1277 AGATCGAGATTTTCAGGCACATCCTGCAGCGGGAGTT TTCAGGA 1320 

llllll I I II IMM lltlllMI t I I 

2149 AGATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAG 2198 

1321 CTGAGCCACCTTTCCCGAAAGTTCACCGAGTGGTGCTATGGGCCTGTCCG 1370 

M Mill III I IIMMIIMlttll Ml Mill II I 
2199 TGCAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCA 2248 

, • • • • 

1371 GGTGTCGCTGTATGACCTGGCTTCTGTGGACAGCTGTGAGGAGAACTCAG 1420 

MM Ml I I I MM Ml III lllltll I 

2249 CTCCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGG 2298 

1421 TGCTGGAGATCATTGCCTTTCATTGCA. . •AGAGCCCGCACCGACACC6A 1467 

Ml I M MM Ml Ml III I M Ml 

2299 TGCTGGAGGTGATCGCCTACAGCAGCAGCGAGACCXX:TAATCGCCACGAC 2348 

, . • • * 

1468 ATGGTCGTTTTGGAGCCCCTGAACAAACTGCTGCAGGCGAAATGGGA. • • 1514 

Ml M I MMMI IMIM III lllllll II IMM 
2349 ATGCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAG 2398 

1515 TCTGCTCATCCCCAAGTTCTTCTTAAACTTCCTGTGTAATCTGATCTACA 1564 

I Ml I M MM Ml IIMillll I I MM 
2399 ATTCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACA 2448 

, ■ • • • • 

1565 TGTTCATCTTCACCGCTGTTGCCTACCATCAGCCTACCCTGAAGAAGCAG 1614 

II MIIMMIM I MMMI I I Ml M 

2449 TGATCATCrTCACCATGGCTGCCTACTA....CAGGCCCGTGGATGGCTT 2494 

• • • • 

1615 GCCGCCCCTCACCTGAAAGCGGAGGTTGGAAACTCCATGCTGCTGACGGG 1664 

III III I I II I I Mill III II IMM 
2495 GCCTCCCTTTA,.AGATGGAAAAAATTGGAGACTATTTCCGAGTTACTGG 2542 
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•••••• 

1665 CCACATCCTTATCCTGCTAGGGGGGATCTACCTCCTCGTGGGCCAGCTGT 1714 

I I I I I I t I I I I I 1 I I I I I I t t I I I II 

2543 AGAGATCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTC 2592 

1715 GGTACTTCTGGCGGCGCCACGTGTTCATCTGGATCTCGTTCATAGACAGC 1764 

ill III tl I I I till It I III t Itlllt 
2593 AGTATTTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGC 2642 

1765 TACTTTGAAATCCTCTTCCTGTTCCAGGCCCTGCTCACAGTGGTGTCCCA 1814 

IN III 11 II. .1.11 I I III I Ml Ml Mi M 
264 3 TACAGTGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGT 2692 

1815 GGTGCTGTGTTTCCTGGCCATCGAGTGGTACCTGCCCCTGCTTGTGTCTG 1864 
lllltIM III I 11 II Ml Ml I II I 

2693 GGTGCTGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCT 2742 

18 65 CGCTGGTGCTGGGCTGGCTGAACCTGCTTTACTATACACGTGGCTTCCAG 1914 

I MM MIIIIM III Mil Mill M II II llllll 
27 43 CCCTGGCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAG 2792 

• • • ■ 

1915 CACACAGGCATCTACAGTGTCATGATCCAGAAGGTCATCCTGCGGGACCT 1964 

M I llllllll MIIIIM Mill I MUM I Mill 
2793 CAGATGGGCATCTATGCCGTCATGATAGAGAAGATGAtCCTGAGAGACCT 2842 

• • • " * 

1965 GCTGCGCTTCCTTCTGATCTACTTAGTCTTCCTTTTCGGCTTCGCTGTAG 2014 

I li III I I Mill I llllll i Mill II t M 

284 3 GTGCCGTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAG 2892 

2015 CCCTGGTGAGCCTGAGCCAGGAGGCTTGGCGCCCCGAAGCTCCTACAGGC 2064 

I llllll MM I II I I I MM 

2893 CGGTGGTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAG 2942 

• • • • * 
2065 CCCAATGCCACAGAGTCAGTGCAGCCCATGGAGGGACAGGAGGACGAGGG 2114 

111 1 Ml t III I I II 

2943 TCCA CGTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCC 2980 

2115 CAACGGGGCCCAGTACAGGGGTATCCTGGAAGCCTCCTTGGAGCTCTTCA 2164 

I II I lilt II Ml I lllllll MM 

2981 CCCCGATAGCTCCTACAACAGCCTGTACTCCACCTGCCTGGAGCTGTTCA 3030 

2165 AATTCACCATCGGCATGGGCGAGCTGGCCTTCCAGGAGCAGCTGCACTTC 2214 

I IIMMIIMMIIMIMI nil Ml Ml t Mill 
3031 AGTTCACCATCGGCATGGGCGACCTGGAGTTCACTGAGAACTATGACTTC 3080 

• • • • • 
2215 CGCGGCATGGTGCTGCTGCTGCTGCTGGCCTACGTGCTGCTCACCTACAT 2264 

I I I I I ttllllllMIMI II I IIIMIIIIII 
3081 AAGGCTGTCTTCATCATCCTGCTGCTGGCCTATGTAATTCTCACCTACAT 3130 

• • • • • 
2265- CCTGCTGCTCAA»TGCTCATCGCCCTCATGAGCGAGA<XGTCAACAGTG 2314 

III IIMIMMMIIinilMIIIIIM I Hill IMUM 
3131 CCTCCTGCTCAACATGCTCATCGCCCTCATGGGTGAGACTGTCAACyA 3180 

• • • • 
2315 TCGCCACTGACAGCTGGAGCATCTGGAAGCTGCAGAAAGCCATCTCTGTC 2364 

MM M III M IIMMIIIMMMM lllllll I M 
3181 TCGCACAGGAGAGCAAGAACATCTGGAAGCTGC7VGAGAGCCATCACCATC 3230 
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2365 CTGGAGATGGAGAATGGCTATTGGTGGTGCAGGAAGAAG . . . CAGCGGGC 2411 

lltll I llllll lit Mill II Mil Ml 

3231 CTGGACACGGAGAAGAGCTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTC 3280 

2412 AGGTGTGATGCTGACCGTTGGCACTAAGCCAGATGGCAGCCCGGATGAGC 24 61 

Ml I Mill II II I i Ml II I I 

3281 AGGCAAGCTGCTGCAGGTGGGGTACACACCTGATGGCAAGGACGACTACC 3330 

• • * * • ' 

24 62 GCTGGTGCTTCAGGGTGGAGGAGGTGAACTGGGCTTCATGGGAGCAGACG 2511 

I 11 I II I II I li I J M II M I I t I I I M I I I I II I I t 

3331 GGTGGTGCTTCAGGGtGGACGAGGTGAACTGGACCACCTGGAACACCAAC 3380 

2512 CTGCCTACGCTGTGTGAGGACCCG...TCAGGGGCAGGTGTCCCTCGAAC 2558 

II I I It llllll I I II Ml II II 
3381 GTGGGCATCATCAACGAAGACCCGGGCAACTGTGAGGGCGTCAAGCGCAC 34 30 

2559 TCTCGAGAACCCTGTCCTG GCTTCCCCTCCCAAGGAGGATGAGGAT 2604 

f I I I I I I M M II II 

3431 CCTGAGCTTCTCCCTGCGGTCAAGCAGAGTTTCAGGCAGACACTGGAAGA 3480 

2605 GGTGCCTCTGAGGAAAACTATGTGCCCGTCCAGCTCCTCCAGTCCAACTG 2654 

I I I Mi II I Ml II II 

3481 ACTTTGCCCTGGTCCCCCTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAG 3530 

2655 ATGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCA 2704 

It III M I I Mill Ml lit I I 

3531 TCTGCTCAGCCCGAGGAAG'rt'TATCTGCGACAGTTTTCAGGGTCTCTGAA 3580 

2705 ACCACATCTGCTGGCTCTGGGGTCCCAGTGAATTCTGGTGGCAAATATAT 2754 

III I I I Ml MM It I Ml I 

3581 GCCA GAGGACGCTGAGGTCTTCAAGAGTCCTGCCGCTTCCGGGGA 3625 

2755 ATTTTCACTAACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2804 

llllll till II I III 

3626 GAAGTGAGGACGTCACGCAGACAGCACT6TCAACACTGGGCCTTAGGAGA 3675 

2805 AAAAA 2809 

3676 CCCCGTTGCCACGGGGGGCTGCTGAGGGAACACCAGTGCTCTGTCAGCAG 3725 
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CLUSTAL H (1.74) multiple sequence alignment 

humanVR2 MTSPSSSPVraLETUXX^OEDGSEADRGKLDFXSSGLPPMESQFOGEDRKFAPOIRVN^ 

rfttVR2 

humanVR2 RKGTCUlSQPDPNRFDRDRLFNAVSRGVPEDIAGLPEYLSKTSKYLTDSEyTEGSTGKl^ 

ratVR2 

humanVR2 MKAVlJn.KIX5VKAClLPLLQIDRDSGNPQPLVNA0CTDDyYRGHSAIJIIAIEKRSI^^ 

ratVR2 - - - - 

huinanVR2 LLVENGANVHARACGRFTOKG<Xn'CFyTOELPLSLAACTKQWDWSyLLENPHQPAS^^ 

ra t VR2 STHASALSLAACTKQWDVVTYLLENPHQPASLEA 



humanVR2 TDSQGNTVLHALVMISDNSAENIALVTSMYIX3UiQAGARIX:PTVQLEDIRNW 
ratVR2 TDSLGJnVLHALVMIADNSPENSALVIHira)GLIiQMGARIX:PTVQL^^ 



humanVR2 AAKEGKIEIFRHILQREFSG-LSHLSRKFTEWCyGPVRVSLYDLASVDSCEENSVLEIIA 
ratVR2 AAKBGKIEIFTUilLQREFSOPYQPI^SRKFTEHCYGFVRVSLYDIiSSVDSHBKHSVIiBXIA 



humanVR2 FHCKSPHRHRMVVI^PlJnaJiQAKKDIXIPKFF^nMn*IYMPIFTA^ 
ratVR2 FHCKSPNRHRMVVLEPimiMEKiroRLVSRrFIOTACT^ 

**«•♦*.«***«**«•**«*** «•« * *•«*«*«* ,*«**«*.*^ • « 

\ 

huinanVR2 APHLKAEVGHSHLLTCHltlLLGGIlOXVt^Iiim^^ 
ratVR2 IPSSfJiTFQZSmAJUSBlhlhUQOirUjl^^ 

ratVR2 hTVLSQmjm^ETBm^UjVX^hVlJSmi^ 

hufflanVR2 LIYLVFLF^FAVALVSXiSQBAiniPEAPTXSPKATBSVQPMBGQEDEG^ 
ratVR2 LVYLVFZ<FOFAVALVSIjSRBARSPK»BZ2ra7STVTEQPTVGQ^ -APYRSZLDASLB 

*.««****••******«•:** , «•« • * 

lu«aanVR2 XFKFTIGMGBLAFQBQXfiFRGKVXlIiZXI^^ 
ratVR2 i;FKFTiaM3EXAPQEQIAFR£WVUXIJA^^ 



huBiaxiVR2 WJjgiK^J€Vl£t€SlKmf^ 

ratVR2 HlOOlCAZSVZiBKBKOnmCKRKlS&BCna^ 



littmanVR2 PTIiCSDPSaAOVPim^NPVLASPPKBDED^ 

ratVR2 pnigKDPfiqPQITU N iU W PTCK'tKIK-*-KSASSKimiPI/m^ 
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GAP ofi ratvra.pep check t S190 froms 1 tot SS4 
ratVR2 rlrxbl47gll 

to: huiiianvr2.pep check: 5746 from. I to: 764 
htinvanVRl Flh21ell 

Sy^l couparieon table: /uer/local/gcg_9.l/gcgcore/data/rundata/bloBua,«.c™p 

CocnpCheck: S430 

Gap Weight: 12 Average Match: 2.912 

Length Height: 4 Average Mismatch: -2.003 

Quality: 2182 ^e^Sth: 766 

Ratio: 3.93S W= * 

percent similarity: 81.703 Percent Identity: 79.167 

Match display thresholds for the alignment (s) : 
I . IDENTITY 
: " 2 
. = 1 

ratvr2.pep x humanvr2.pep 



1 STH^S^^IAAC^WDVVTm^ ** 

llllllllllll . Ill-iill MINI 

54 

300 



45 «.VMIAI«s;E«SAI.VnW^U«HaA^ 

lint III II III II MM I I MliH 

301 349 



ISVDSW 144 



145 BIWSVIiBIIAFHCTOPW»lR«vv**irA«~*^ II II I I 



294 



'*^?mmiiimrTiiiinii imii-iin 



295 



344 



I iMiimTiiiillllllilllllllllllillllll'tlllilll 



FIOTFE 8 
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3,3 LFKrr:««Ei««EQU.™Gvyua^ 



539 SASEEDHLPLQVLQSP 554 
749 GASEEinfVPVQLWSH.764 
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GAP of: humanvrl.seq check: 4554 from: 1 to: 3909 
humanVBl Fbhl854 7pat - Import - complete 

to: ratvrl.seq check: 7921 from: 1 to: 2847 

ratVRl.seq AF029310 in GenBank 

Symbol comparison table: 
/dSriocal/gcg/gcg_9.1/gcgcore/data/rundata/nwsgapdna.cmp 

CompCheck: 8760 

Gap weight: -50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: O.OOO 

Quality: 22717 Length: 3914 

Ratio- 7.979 Gaps: 10 

Percent Similarity: 82.125 Percent Identity: 82.125 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 5 
. = 1 

bumanvrl.eeq x ratvrl.seq . 

1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAG^CACTTAGTCGTCAGATC^^^ 1050 
_ _ CAGCTCCAAGGCACTTGCTCC 21 

22 ATiTGGGGiGTGCCTGCACCT.-.AGCTGGTTGCAAATTGGGCCACAGAG 68 
1101 GATCCAGCA^GGATGAAGA^TGGAGCAGCACAG^^^^ 1150 
69 iiiiTciAiiiiiiiGLvciiciGGCTAGCTTAGACTCAGAGGAGTC^^^ 118 

119 l^cllt^O^li^'^^^ 
1201 ACTCCAGGC^CCTCCAGci^AGCCCC^^ 1«0 

1251 ACCCGGCTCTTTGGGAAGG^TGACTCGGA^GGCTTTCCCGGTGGATTG 1300 
1351 CTSTtATCAicATCC«GA(:aa:»GG»CTCMCCC<aC^ "»» 
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1401 CTGTCCCAG6ACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTCTA 1450 

iiiiiiiini III nil f null IN iiiiiiii 

369 TCATCCCAGGACTCCGTCTCCGCTGG...TGAGAAGCCCCCGAGGCTCTA OS 
1451 TGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGATC 1500 

1 1 1 M 1 1 1 1 1 II I II II II M II I n II n 1 1 1 II n II I II i 

416 iiUiGiAiGAGCAiciiciATGCTGTGGCTCAGAGT^^^ 165 
1501 TGGAGAGCCTGCTGCTCTTCCTGCAGAAgAgCAAGAAGCACCTCACAGAC 1550 

1 1 M I M 1 1 1 1 1 n I I II 1 1 1 1 1 n I I n 1 1 1 n 1 1 1 in n in 

1551 AACGAGTTCAAAGACCCTgAgACAGGGAAGACCTGTCTGCTG/^^^ 1600 

I iiiiiniiiniii iniiin nnninnni nixni 

516 LiiicTTiiAiiAicCAGAGACAGGAAAGACCTCTCTGCTAAAAGCC^ 
1601 GCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGA^^ 

I 1 1 1 1 1 1 I II I II 1 1 11 II II i I I II I II I n II 

566 icrcAATCTGCACAATGGGCAGAATGACACCATCGCTCTGCTCCTGGAC 

1651 tcgcgcggcAaacggacagcctgaaggagcttgtcaacgcca^^^ 
I II III I 11 II II n II 1 1 n n n n > > i n i n i ii 1 1 1 

616 !TiicCGGAiGA<!:AGACAGCCTGAAGCAGTTTGTCAAT^^^ 

1701 6ACAGCTAci-ACAAGGGCC;GACAG<^CTGCAC^^^ 

I I I M I I I I I I M i I t I I I I t t I t I M I I I M Mill 11 t M 

1751 CAACATGGCCCTGGTGACCCTCCTGGTGG^^^ 1^00 
Mini I IMMIIIIIM IMM III* IMHIN IIHMI 
716 ciiiiiGAiGiUGri^^^^ 

ctgcggcccAtggggacttctttaagaaa^cca;^^^ 1850 

815 



565 

1650 

615 

1700 

665 

1750 

715 



1801 CTGCGGCCCATGGGGAUX-11.1 i ^'^"'^77, ,";T7T : mum mi 
Mlllll 1 lllllllilll llll MIMIIMt IIIMIl til 

1851 TACTTCGGTGAACTGCCCCTGTCCCTG^^^ 

816 iiyiTUiUcii^iiiiiiiiciiicTa^ 865 
1901 CATCGTGAAGTTCCTGCTG^:AGAACTCCTGGCAGAC^^ 1950 

Ml 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ni 1 1 1 1 I M I I 1 1 t II It I II II 

1951 CCAGGGACTCGGTGGGCAA^:ACG6TGCTGCAC^^^^^ 2000 
M Mlllll M 1 1 n I t II I I I M I I t I If I I I I 1 I M I M 

2001 GACAACACG^CGACAACACGAAGTTTGTG^^^ 2050 
' ■ I t I f M i IIIIIIII M l Mil M M M I I I IMM 
966 iiTiiiii:AiTTiiiiiiiiciiiiTciWGCAT^^^ 1015 

2051 TCTGATGCT^GGG<KX3UU.CTGCACCCGAC^^ "00 

MM IIMMIMMIM Ml IMMIIIIIM Ml Ml 
1016 CTiGiicyciiGiiiiAiCTCCACCC^^^ 10« 
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CCGCTGt 

iiiiii mill I inn iiiiitniii ni t i nun 



2101 CCMCMGAAGGCy^TGACGCCGCTGGCTCTGGCAGCTGGGACCGGGAAG 2150 



1066 CCAACAGGAAGGGGCTCACGCCACTGGCTCTGGCTGCTAGCAGTGGGAAG 1115 

2151 ATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAGTG 2200. 

,, I I I I 11 I I I I I I I I I I I II I I n I I II i n I I I I II I I I M I II 

1116 ATCGGGGTCTTGGCCTACATTCTCCAGAGGGAGATCCATGAACCCGAGTG 1165 

2201 CAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCACT 2250 

I 1 1 1 n I n n n 1 1 1 i • • " < " ' " " 

1166 CCGACACTTATCCAGGAAGTTCACCGAATGGGCCTATGGGCCAGTGCACT 1215 

CCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGGTG 2300 



V^V^iV-WX,-^ 

,111 II n iiiiiiiiiiiiii iiiMNi 11 iiii'i 

1216 CCTCCCTTTATGACCTGTCCTGCATTGACACCTGTGAAAAGAACTCGGTT 1265 
2301 CTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGACAT 2350 

I I I I III II inn iiimiiiii n n "iiii 

1266 CTGGA6GTGATCGCTTACAGCAGCAGTGAGACCCCTAACCGTCATGACAT 1315 
2351 GCTCTTGGT^GCCGCTgAaCCGACTCCTGCAGGACAAGTGGGACAGAT 2400 

2351 ecTi,! ^^11 iiiiniiiniiniiii 

1316 gCTTCTCGTGGAACCCTTGAACCGACTCCTACAGGACAAGTGGGACAGAT 1365 
2401 tCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTA(»^ 2450 

I iiiniiinniininniiiiii i nnnin nii ni 

1366 TTGTCAAGCGCATi 



■CTTCTACTTCAACTTCTTCGTCTACTGCTTGTATATG 1415 
>CCATGGCTGCCTACTACAGGCCCGTGGATGGCTTGCCTCC -2500 



2451 ATCATCTTCA* 

niini iiiiiiiiiin ii nm miihk <> 

1416 ATCATCTTCAl 



CCGCGGCTGCCTACTATCGGCCTGTGGAAGGCTTGCCCCC 14 65 



2597 
1565 
2647 
1615 



2501 CTTTAAGATaGAAAAAA..:TTGGAGACTATTTCCGAG^ 2547 

II nil II Mill nil niMiiniiin ii iimii 

1466 CTATAAGCTGAAAAACACCGTTGGGGACTATTTCCGAGTCACCGGAGAGA 1515 
2546 TCCTGTCTGiGTTAGGAGG^GTCTACTTCTTTTTCXrGAG^^ 

II ifiiiiiii iiMimiiiMmii """"""'1''' 

1516 TCTTGTCTGTGTCAGGAGGAGTCTACTTCTTOTOrGA^^ 

2598 TTCCTGCAG^GGCGGCCGT^TGAAGAaCTCTTTGTG^^ 

1 1 II II III IN 1 1 II II I HI! MliniMnlMIMM 
1566 iicCTGCAGAGGCGACCATCCCTCAAGAGTTTGTTTGTGGAW^^ 

2648 TGAGATGCTTTTCTTTCTG^AGTaCTGOTCATGCTGGC^AC^ 2697 

IIIIII iniiiiii I Mill iiiiiiiMiin I "III ' 

1616 iiiiAWCTTTTCTTTGTACAeTCGCTGTTCATGCTGGTGTCTGTGGTAC 1665 

2698 TGTACTTO^CACCTCAAGGAGTATGTGGCTTCCATGGTATTCTCCCTG 2747 

2698 TGTACTTCAGCCA^ |imillllll ">llll 

1666 iiiiyiiiiiiAAiGCAAGGAGTATGTG^^ 

2748 GCCTTGGGCTGGACCAACAkcTCTACTAaACCCGCGGTTTC^^ 2797 

III iiiiniiniitiiiiiiiiiiii mil '""""'1 

1716 iikTGGGCTGGACCAACATGCTCTACTATACCCa^^ ^^S 
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2-798 GGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGA6AGACCTGTGCC 

II III! Illlllll II I Illllllllllll 

1766 GGGCATCTATGCTGTCATGATTGAGAAGATGATCCTCAGAGACCTGTGCC 

2848 GTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAGCGGTG 
I II Mill Mini MM Illlllll II IMIIIIIIII III 
1816 GGTTTATGTTCGTCTACCTCGTGTTCTTGTTTGGATTTTCCACAGCTGTG 



2847 
1815 
2897 
1865 
2947 



2898 GTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAGTCCAC 

Hill Illlllll I Mill INI Mill '""i'l 

1866 GTGACACTGATTGAGGATGGGAAGAATAACTCTCTGCCTATGGAGTCCAC 1915 

2948 GTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCCCCCCGATAGCTCCTACA 2997 

Mill MM IMIIMII I II I II Ml MM 

1916 ACCACACAAGTGCCGGGGGTCTGCCTGCAAG...CCAGGTAACTCTTACA 1962 

2998 ACAGCCTGTACTCCACCTGCCTGGAGCTGTTCAAGTTCACCATCGGCATG 3047 



llllll I II III lilllMMIIMMIMMM 

ACAGCCTGTATTCCACATGTCTGGAGCTGTTCAAGTTCACCATCGGCATG 

GGCGACCTGGAGTTCACTGAGAACTATGACTTCAAGGCTGTCTTCATCAT 
IIIIIIIM MIIMI III """'1''''! 



2013 GGCGACCTGGAGTTCACTGAG^^ 2062 

* • • " 

3098 CCTGCTGCTGGCCTATGTAATTCTCACCTACATCCTCCTGCTCAACATGC 3147 

nil I I Ml MIIMI MMIMMIMMMI II Mill MM 1 1 
2063 CCTGTTACTGGCCTATGTGATTCTCACCTACATCCTTCTGCTCAACATGC 2112 



3148 



TCATC6CCCTCATGGGTGAGACTGTCAACAAGATCGCACAGGAGAGCAAG 3197 



MM 



'll nil MM! MIIMI MM Mill IHH''' 



2162 



,«,w.TCTGGAAGCTGCAGAGAGCCATCACCATCCTGGACACGGAGAAGAG 3247 

mTTmmmiiiiiiiiiimmiiiim I 

GiAAGCTGCAGAGAGCXATCACCATCCTGGATACAGAGAAGAG 2212 



2113 TCATTGCTCTCATGG6TGAGACCGTCAACAAGATTGCACAAGAGAGCAAG 

3198 AACA' 
MM 
2163 AACATCTi 

3248 CTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTCAGGCAAGCTGCTGCAGG 3297 
3248 CTTCCliAA , , , , , , , || | | | | | | | | | M I II M M H H M I M II I 
2213 iUiiioiiGXiiiGGiAiiiiiTCCGCTCT^ 2262 

3298 TGGGGTACA«CCTGATGG^GGACGRCTACC(MTGCT^ 3347 

I II III III iiiM MIMIII IIIMIIIMMM tHHIMI 
2263 iiiiGTTcicTCCTGACGGCAAGGATGACTACCGGTGGTGTTTCAGGGTC 2312 

3348 GACGAGGTGAACTGGAaaVCCTGGAAO^CC^^^^ 3397 

11111111 I MIIMI MiM Mini Mill inn linn 

2313 iiiiAGGTAAACTGGACTACCTGGAACACCAATGTGGGTAT^^ 2362 

3398 AGACCCGGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTTC 3447 

3399 *^^^^^,,,,,,,,,,,,,,,,|,|iMiiiiiiinMnn 

2363 GGACCCAGGciACTGTGAGGGCGT(»AGCGCACCCTGAGCTTCT^ 2412 
3... c»^„^0^=^^ 

2413 GGTCAGGCCGAGTTTCAGGGAGAAACTGGAAGAACTTTGCCCTGGTTCCC 2462 
3498 CTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAGTCTGCTCAGCCCGAGGA 3547 

FIGURE 9 (cont'd) 
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3S.S ACTXTATCT™^ 

2513 iiiicAAciiAAGcATTATACGGGATCCCTTAAGCCAGAGGATGCTGAGG 2562 

3598 TCTTCAAGAGTCCTGCCGCTTCCGGGGAGAAGTGA.GGACGTCACGCAGA 3646 

I I I I I I I It I I I I I I I I 1 1 I I I I I I I M M 
2563 miiiAGGATTCCATGGTCCCAGGGGAGAAATAATGGACACTATGCAGG 2612 

3647 CAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTG 3696 
... I I III I I I I I M 

2613 GixciAii CGGGGTCTTTGGGTGGTCTG 2640 

3697 CTGAGGGAACACCAGTGCTCTGTCAGCAGCCTGGCCTGGTCTGTG^^^^ 3746 

I I I I I I M I I I Ml 11 II' I I I I I M M II 

2641 cixiGiiMC.CAGCAiGGTTGACGTTATCTGGGTCCACTCTGTGCCTGC 2689 

3747 CCA.GCATGTTCCCAAATCTGTGCTGGACAAGCTGTGGGAAGCGTTC^^ 3795 

I I I I I I M I 1 M I III I I t II M M I I I 
2690 iTiGi<l:icAiiiiTiGGAciTCGGCGGGCCTGCTGTGGGA^ 2738 

3796 GAAGCATGGGGAGTGATGT^CATCCAACCGTCACTGTC^^^ 3845 

I I I M M I I II II I It 1 I II M M 1 

2739 TGTiTGGGAATTiAGiiGiGTATCCAACCATGA...TCTCC^^^ 2785 

384 6 TCCTAACAG^CTTTCAGGT^TTTACTCACTTTACTAAAA;^^ 3895 
2786 GCTiTCAicTiiiiATGkciiiitiAiACAGAGTGAATGGCAAA^^ 2835 

3896 AGGGCGGCCGCTTA 3909 
2836 ACTTGGACACAT.. 2847 
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GAP of: humanvrl . pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum, 
to: ratvrl.pep check: 5764 from: 1 to: 838 

ratVRl I AF029310 Rattus norvegicus vanilloid receptor subtype 1 mRNA, 
complete 
cds . 

Symbol comparison table: 
/ddm_local/gcg/gcg_9. l/9cgcore/data/rundata/blosum62 . cmp 
CompCheck: 64 30 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 3734 Length: 840 

Ratio: 4.456 Gaps: 3 

Percent Similarity: 89.247 Percent Identity: 86.022 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 2 
, - 1 



htimanvrl .pep x ratvrl.pep 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 
I.. .11 .1 |...t I) I III il : I :II)MI 

1 MEQRASLDSEESESPPQEKSCLDPPDRDPNCKPPPVKPHIFTTRSRTRLF 



50 



50 



51 GKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQD 100 

llllllll 1.111:111 I Ml till 1:111111111 I III 
51 GKGDSEEASPLDCPYEEGGIASCPIITVSSVLTIQRPGDGPASVRPSSQD 100 

101 SVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEFK 150 
IM II llllllltl:tlM.llhlMII 111:111 III. Ml 

101 SVSAG.EKPPRLYDRRSIFDAVAQSNCQELESLLPFLQRSKKRLTDSEFK 149 
• • • • • 

151 DPETGKTCLLKAMLNLHDGQNTTIPLLLEI.ARQTDSLKELVNASYTDSYY 200 
MMMMMMIIMMM II Ml::ll.llill: MIMIIMI 

150 DPETGKTCLLKAMLNLHNGQNDTIALLLDVARKTDSLKQFVNASYTDSYY 199 



201 KGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYEXSE 250 

IMMMIIMMII IIMMMIIIMMMMMIIIIIIIMIM 
200 KGQTALHIAIERRNMTLVTLLVENGADVQAAANGDFFKKTKGRPGFYFGE 24 9 

♦ . • • • 

251 LPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNTA 300 

IMIIIIillM MMIIMIII MMIMMMMMMIMMM 
250 LPLSLAACTNQIAIVKFLLQNSWQPADISARDSVGNTVLHALVEVADNTV 299 

, . • • • 

301 DNTKFN^SMYNEIIJILGAKLHPTLKLEELTNKKGMTPLAIAAGTGKIGVL 350 

MIIMMMIMMIMIIIIMMI:ll:ll:IMIIII .MUM 
300 DNTKFVTSMYNEILILGAKLHPTLKLEEITNRKGLTPLALAASSGKIGVL 349 

, • . • * 

351 AYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 400 

ttlllM MIIMIIIIMMIMIIIIIMMMIIMIIIM 

350 AYILQREIHEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 399 
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401 AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNrLVyCLYMIIFT 450 

IIIMIIIIIIIIIIIIIIIIIMIIIir HIM '••""'•l 

400 AYSSSETPNRHDMLLVEPLNRLLODKWDRFVKRIFYFNFFVyCLYMIIFT 44 9 

451 MAAYYRPVDGLPPFKMEK.iGDYFRVTGEILSVLGGVYFFFRGIQYFLQR 499 

I I II I 11:1111: I:. :lllllllllll .oo 

450 AAAYYRPVEGLPPYKLKNTVGDYFRVTGEILSVSGGVYFFFRGIQYFLQR 4 99 



550 
550 
600 



549 



500 RPSMKTLFVDSYSEMLFFLQSLFMLATWLYFSHLKEYVASMVFSLALGW 

lll-l IIIIIIII.IIMIilll •IMIII IIIIIIIIIMIill 
500 ip^LKSLFVDSYSEiLFFVQSLFMLVSWLYFSQRKEYVASMVFSLAMGW 549 

TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAWTLI 599 

llllllllllllllllllllll 11111111:1 IIIIIMIIIIII 

TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYLVFLFGFSTAVVTLI 599 



649 



EDGKNDSLPSESTSHRWRGPACRPPDSSYNSLYSTCLELFKFTIGMGDLE 

II III I: II II: I .IIIIIIIIMNIIIIIIIIil) 

600 EDGKNNSLPMESTPHKCRGSACK.PGNSYNSLYSTCLELFKFTIGMGDLE 648 

650 FTENYDFKAVFI ILLLAYvi LTYILLLNMLIALMGETVNKIAQESKNIWK 

650 ,,,,,,,,,,,,11,11 II II 

649 riEiioFKAVFIILLIAYVILTYILLLKtaiAIJ^GET 

700 LQRAITILDTEKSFLKCMRkFRSGKLUjOGYTPDGKDDYRWCF^^ 

,,l,,||i|||t|||ttllllllllllllll:l<lllllllltlllllll 
699 iiiililLDTEKSEiKCMRKAFRSGKLLQVGFTPDGKDDYRWCFRVDEVN 

750 WTTWNTIWGhNEDPGNCEGVKRTLSFSL^SRVSGRHWimFALyPLL^ 

llllllllllllllllllllilXIMIIII lllll.llllllllllh 
74 9 iiiJ^iiiGlINBDPGNCEGVKRTLSFSLRSGRVSGRNWKNFALVPLLRD 

800 ASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 

11 III . I III I: t.lll Mil III 

799 ASTRDRHATQQEEVQLKHYTGSLKPEDAEVFKDSMVPGEK 838 



699 
698 
749 
748 
799 
798 
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CLUSTAL M (1.74) multiple oequence alignment 



humanVR2 .alt 
huinanVR2 



MTS 



PSSSPVFIU^TUXXKiEIX3SEADRGKIJ>F<3SGLPPMESQFX)GEDRKFAPQIRVKLOT 



humanVR2.alt 
humanVR2 



RK1 



:GTGASQPDPNRn)RDRLFIlAVSRG\^EDIAGLPEyi^lCrSKyiiTDSEyTEGSTGK^ 



huinanVR2.alt 
humanVR2 



MKAVIin.KTCVNACILPLWJIDRDSGNPQPLVrUQCTDDraGH^ 



humanVR2 alt qrfpqkGQGTCFYFXSELPLSLAACTKOWDWSYLLEKPHOPASLQA 

humaiiVR2 * LLVENGAimiARACGRFPQKGQGTCFyrcELPL^lAACT^^ 

humanVR2 . alt TOS<X3NTVIJIALVMISDNSAENIALVTSMyDGLWAGA^ 

humanVR2 'TOSQGrrrVI^VMISDNSAENIALVTSr^ 

humaiiVR2 .alt AAKEGKIEIFWlILQREFSGI^HI^RKrTEWCYGPVRVSLyDIJ^VDSCEE^^ 

humanVR2 AAKEGiaEIFWIIWREPSGI^HI^RKFTEWOTOPVRVSLYDIJ^VDSCEE^ 



humaiiVR2.alt 
humanVR2 



HCKSPHRHRMVVI^PI/naMAKraU-IPKPFDNFI/3^ 
HCKSPHRHRMVVLEPUiKIiLQAKm)U*I 



humanVR2.alt 
bumanVR2 



PHIJCAEVGMSMIilXOTII.ILI>GGIYia-VGQIOTTOa^ 
PHUMVGMSMIiTGHILIIJ>GGIYU.VGQWmTaRR^ 



humaiiVR2.aXt TVVSQVLCFIJaEWyU>U.VSALVI/3WUtt^ 



humanVR2.alt 
humanVR2 



.FGFAVALVSI^QEAHRPKAPTGFMATESVQPMEOQEDBGNGAQYRGILKASLEL 



hufflanVR2.alt 
faus£sVR2 ' 



huaaAVR2.alt 
huaanVR2 



.-lOCAIgVIiEMBNOTWWCRKKQi^^ 
laXJKAlSVMMBOTYimCRKKQRW^^ 



hum&nVR2.alt 
humanVR2 



USDPfiGWWRTLBKIWUBPPimBDGASBBNyV^ 
ICT)PfiGMVPRTLniPVIASPPKBDEIX»SBEtff^^ 
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|— I 1 1 1 1 1 r— i 1 1 1 ^ ' ' ' I 

50 100 ! 50 200 250 300 350 400 450 500 550 600 650 700 750 800 

A 11 mil III Did II III O H l I D I I ID O f »A*.ha.Reo*ons.C.ou-.as.a„ 

BitiH iii nil Hill iiiMiM miiiiti iiiiiii "^--^^^-^-e-o^- 

e— j-O 1 1 I I I D HlMf— □ IIIKI "eU.Hegions.Chou.Fasa,an 

T li II II ( III n I II i III! HfHH- -^-■"^S^— ^--"obson 

T illlllllHllllllllllllllllllMIII I ill iliiiili il — ...-CH..aa.a„ 

C j I j j II I j II II I II O Coil. Regions -Gamier-Robson 



4J5 



D Hydrophiliclty Plot - Kyte-Oooltttle 



a Hydrophobldty Plot • Hopp-Woods 



II i lllilDII I IH hOHMIWIlHDD^^ aM*a.A.pH^»«,.Re«.».E..n.an, 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains 

hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPL) . 

HMM file: /prod/ddm/seqanal/PPAM/pfam4 .2/Pfam 

Sequence file: /usr/ns-home/docs/seqanal/orfanal/oa-script . 18670 . seq 

Scores for sequence family classification (score includes all domains) • 
Model Description Score E-vaiue N 



an)c Ank repeat 51.5 1.9e-ll 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



an)c 1/3 201 233 .. 1 33 ( ] 34.4 2.6e-06 

anJc i/3 248 283 . . 1 33 (J 13,2 2 

an)c 3/3 333 361 1 33 () 3.4 26 

Alignments of top-scoring domains: 

ank: domain 1 of 3, from 201 to 233: score 34.4, E = 2.6e*06 
* - >nGnTPLHlAarygnvewklLLehGAdvnarck< - * 
+G+T+LH-t-A + n+ +v iL+e+GAdv a + 



hVR-1 201 KGQTALHIAIERRNMALVTLLVENGADVQAAAH 233 

ank: domain 2 of 3, from 248 to 283: score 13.2, E = 2 

*->nGnTPI,HlAatygnvewklLL€. . .hGAdvnartk<-* 
KVP I ->^o G PL lAa ++4.+ 4vki^LL+++ + Ad+ ar+ 

nVK-l 248 FGELPLSIAACTOQLGIVKFLLQnswOTADISARDS 283 

anki domain 3 of 3. from 333 to 361: score 3.4, E = 26 

* ->nGnTPLHlAarygnvewklLLehGAdvnartk<- * 
+G TPL lAa 4^+-f V ++ L+ 
hVR-l 333 KGMTPLALAAC5TXSKIGVLAYILQ REIQEP 361 
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CLUSTAL W (1.74) multiple sequence alignment 

humanVR2 MTSPSSSFVFRLETLDGGOEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPOIRVmt m 

hVR2 . al tFL MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPOlRVMrMv 

humanVR2 RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTri 
hVR2 , al t FL RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 

humanVR2 MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLOCVK 

hVR2 . al tFIi MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 
************ 

humanVR2 LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDWSYLLENPHQPASLOA 

hVR2 . al t FL LLVENGANVHARACGRFFQKGOGTCFYFGELPLSLAACTKQWDWSYLLENPHOPASLQA 
**************************************************^^j^^^^^^^^ 

humanVR2 tdsqgntvlhalvmisdnsaenialvtsmydgllqagarlcptvqledirnlqdltplkl 

hVR2 . altFL TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
****************************************^*^^^^^^^^^^^^^^^^^^^^^ 

humanVR2 AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 

hVR2 . altFL AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 
************** *********«******«*******#***^t**^^^tJt^^^^^^^^^^^^ 

humanVR2 HCKSPHRHRMWLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAA 

hVR2 . altFL HCKSPHRHRMWLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAA 
************* ****** ***************** 

humanVR2 PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
hVR2 .altFL PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
**********************♦*********♦****♦# **********^^^^^^^^^^^ 

humanVR2 TWSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLL 

hVR2 . al tFL TWSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQK 

************************************************* 

huTnanVR2 IYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLEL 
hVR2. altFL - - 

humanVR2 FKFT I GMGELAFQEQLHFRGMVLLLLLAYVLLTYI LLLNML I ALMSETVNSVATDS WS I W 
hVR2. altFL --- - - 

humanVR2 KLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 
hVR2 . altFL - - -KAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 

* ♦****♦♦**********♦*♦**♦***♦**♦********* ^^^^t** 

humanVR2 LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 

hVR2 . altFL LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
***********************************«**«*«**^ 
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SEQUENCE LISTING 
<110> MILLENNIUM PHARMACEUTICALS, INC. 

<120> NOVEL MEMBERS OF THE CAPSAICIN/VANILLOID RECEPTOR 
FAMILY OF PROTEINS AND USES THEREOF 

<130> MNI-062CP2PC 

<140> 
<141> 

<150> 60/108,322 
<151> 1998-11-13 

<150> 60/114,078 
<151> 1998-12-28 

<150> 09/258,633 
<151> 1999-02-26 

<150> 09/421,134 
<151> 1999-10-19 

<160> 20 

<170> PatentIn Ver . 2.0 

<210> 1 
<211> 3909 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1113) . . (3629) 
<400> 1 



gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


60 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


120 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


acgcacttgc 


180 


gggcagtgag 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


240 


acgcactgcg 


ggcagtgagc 


gcaacgcact 


gcgggcagtg 


agcgcaacgc 


actgcgggca 


300 


gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


360 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


420 


cgcaacgcac 


ttaatgtgag 


ttagctcact 


cattaggcac 


cccaggcttt 


acactttatg 


480 


cttccggctc 


gtatgttgtg 


tggaattgtg 


agcggataac 


aatttcacac 


aggaaacagc 


540 


tatgaccatg 


attacgccaa 


gctctaatac 


gactcactat 


agggaaagct 


ggtacgcctg 


600 
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caggtaccgg 


tccggaattc 


ccgggtcgac 


ccacgcgtcc 


gaaaacacac 


ctctctgctg 


660 


tgggaagact 


gtgcaatggc 


acagccgcag 


agcttggttt 


gggaggttga 


agtgctctgg 


720 


ggagaattcg 


tagatcatcc 


tcagaaaagc 


cttgccctgg 


tgttctacca 


gaaaaacgtc 


780 


tcccaatcac 


ccagaaaagc 


tgtccacagt 


agtcccccct 


tatccacggg 


tgtcactttc 


840 


catgggttca 


gttatttgcg 


gtcaaccacg 


gtctgccaat 


attaaatgga 


aaattcttca 


900 


aacagttccc 


aagttttccc 


ttgtgcattg 


ttctgagcag 


tgtgatgaag 


agtctctgcc 


960 


gtgccatctg 


ggatgcaaac 


cgtccctgtg 


tcccccacgt 


ccaggccgta 


gatgctcccc 


1020 


gccggtcagt 


cacttagtcg 


tcagatcgcc 


cgtcctggta 


tcacagtgct 


tctgttcagg 


1080 


ttgcacactg 


ggccacagag 


gatccagcaa 


gg atg aag 
Met Lys 
1 


aaa tgg age age aca 
Lys Trp Ser Ser Thr 
5 


1133 



gga gac ggc ccc ace ggt gcc agg ctg ctg tec cag gac tet gtc gee 
Gly Asp Gly Pro Thr Gly Ala Arg Leu Leu Ser Gin Asp Ser Val Ala 
90 95 100 



ttc ctg cag aag age aag aag cac etc aca gac aac gag ttc aaa gac 
Phe Leu Gin Lys Ser Lys Lys His Leu Thr Asp Asn Glu Phe Lys Asp 
140 145 150 



1229 



gac ttg ggg aca get gcg gac cca etc caa aag gac acc tgc cea gac 1181 

Asp Leu Gly Thr Ala Ala Asp Pro Leu Gin Lys Asp Thr Cys Pro Asp 

10 15 20 

ccc ctg gat gga gac cct aac tec agg cca cct cca gcc aag ccc eag 

Pro Leu Asp Gly Asp Pro Asn Ser Arg Pro Pro Pro Ala Lys Pro Gin 

25 30 35 

etc ccc aeg gee aag age cgc acc egg etc ttt ggg aag ggt gac teg 1277 

Leu Pro Thr Ala Lys Ser Arg Thr Arg Leu Phe Gly Lys Gly Asp Ser 

40 45 50 55 

gag gag get ttc eeg gtg gat tgc ccc cac gag gaa ggt gag ttg gac 1325 

Glu Glu Ala Phe Pro Val Asp Cys Pro His Glu Glu Gly Glu Leu Asp 

60 65 70 

tec tgc ccg ace ate aca gtc age cct gtt ate acc ate eag agg cea 1373 

Ser Cys Pro Thr He Thr Val Ser Pro Val He Thr He Gin Arg Pro 

75 80 85 



1421 



gcc age acc gag aag acc etc agg etc tat gat cgc agg agt ate ttt 14 69 
Ala Ser Thr Glu Lys Thr Leu Arg Leu Tyr Asp Arg Arg Ser He Phe 
105 110 115 

gaa gee gtt get cag aat aac tgc cag gat ctg gag age ctg ctg etc 1517 
Glu Ala Val Ala Gin Asn Asn Cys Gin Asp Leu Glu Ser Leu Leu Leu 
120 125 130 135 



1565 



cct gag aca ggg aag ace tgt ctg ctg aaa gcc atg etc aac ctg cac 1613 
Pro Glu Thr Gly Lys Thr Cys Leu Leu Lys Ala Met Leu Asn Leu His 
155 160 165 



wo 00A29577 



PCT/IIS99/26701 



-3- 



gac gga cag aac acc acc ate ccc ctg, etc ctg gag ate gcg egg caa 1661 
Asp Gly Gin Asn Thr Thr lie Pro Leu Leu Leu Glu He Ala Arg Gin 
170 175 180 



acg gac age ctg aag gag ctt gtc aac gcc age tac acg gac age tac 
Thr Asp Ser Leu Lys Glu Leu Val Asn Ala Ser Tyr Thr Asp Ser Tyr 
185 190 195 



acc ggg aag ate ggg gtc ttg gcc tat att etc cag egg gag ate eag 
Thr Gly Lys He Gly Val Leu Ala Tyr He Leu Gin Arg Glu He Gin 
345 350 355 



1709 



tac aag ggc eag aca gca ctg cac ate gee ate gag aga egc aac atg 1757 
Tyr Lys Gly Gin Thr Ala Leu His He Ala He Glu Arg Arg Asn Met 
200 205 210 215 

gcc ctg gtg acc etc ctg gtg gag aac gga gca gac gtc cag get gcg 1805 
Ala Leu Val Thr Leu Leu Val Glu Asn Gly Ala Asp Val Gin Ala Ala 
220 225 230 

gcc cat ggg gac ttc ttt aag aaa acc aaa ggg egg cct gga ttc tac 
Ala His Gly Asp Phe Phe Lys Lys Thr Lys Gly Arg Pro Gly Phe Tyr 
235 240 245 

ttc ggt gaa ctg ccc ctg tec ctg gcc gcg tgc acc aac cag ctg ggc 
Phe Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Asn Gin Leu Gly 
250 255 260 

ate gtg aag ttc ctg ctg eag aac tec tgg cag acg gcc gac ate age 
He Val Lys Phe Leu Leu Gin Asn Ser Trp Gin Thr Ala Asp He Ser 
265 270 275 

gcc agg gac teg gtg ggc aac acg gtg ctg cac gcc ctg gtg gag gtg 
Ala Arg Asp Ser Val Gly Asn Thr Val Leu His Ala Leu Val Glu Val 
280 285 290 295 

gcc gac aac acg gcc gac aac acg aag ttt gtg acg age atg tac aat 
Ala Asp Asn Thr Ala Asp Asn Thr Lys Phe Val Thr Ser Met Tyr Asn 
300 305 310 

gag att ctg atg ctg ggg gcc aaa ctg cac ecg acg ctg aag ctg gag 
Glu He Leu Met Leu Gly Ala Lys Leu His Pro Thr Leu Lys Leu Glu 
315 320 325 

gag etc acc aac aag aag gga atg acg ccg ctg get ctg gca get ggg 2141 
Glu Leu Thr Asn Lys Lys Gly Met Thr Pro Leu Ala Leu Ala Ala Gly 
330 335 340 



1853 



1901 



1949 



199-7 



2045 



2093 



2189 



gag ccc gag tgc agg cac ctg tec agg aag ttc acc gag tgg gcc tac 2237 

Glu Pro Glu Cys Arg His Leu Ser Arg Lys Phe Thr Glu Trp Ala Tyr 

360 365 370 375 

ggg ccc gtg cac tec teg ctg tac gac ctg tec tgc ate gac acc tgc 2285 

Gly Pro Val His Ser Ser Leu Tyr Asp Leu Ser Cys He Asp Thr Cys 

380 385 390 

gag aag aac teg gtg ctg gag gtg ate gcc tac age age age gag acc 2333 

Glu Lys Asn Ser Val Leu Glu Val He Ala Tyr Ser Ser Ser Glu Thr 
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395 400 405 

cct aat cgc cac gac atg etc ttg gtg gag ccg ctg aac cga etc ctg 2381 
Pro Asn Arg His Asp Met Leu Leu Val Glu Pro Leu Asn Arg Leu Leu 
410 415 420 



eag gac aag tgg gae aga ttc gtc aag cgc ate ttc tac ttc aac ttc 2429 
Gin Asp Lys Trp Asp Arg Phe Val Lys Arg lie Phe Tyr Phe Asn Phe 
425 430 435 

ctg gtc tac tgc ctg tac atg ate ate ttc acc atg get gee tac tac 2477 
Leu Val Tyr Cys Leu Tyr Met lie He Phe Thr Met Ala Ala Tyr Tyr 
440 445 450 455 

agg ccc gtg gat ggc ttg cct cec ttt aag atg gaa aaa att gga gac 2525 
Arg Pro Val Asp Gly Leu Pro Pro Phe Lys Met Glu Lys He Gly Asp 
460 465 470 

tat ttc cga gtt act gga gag ate ctg tct gtg tta gga gga gtc tac 2573 
Tyr Phe Arg Val Thr Gly Glu He Leu Ser Val Leu Gly Gly Val Tyr 
475 480 485 

ttc ttt ttc cga ggg att eag tat ttc ctg eag agg egg ccg teg atg 2621 
Phe Phe Phe Arg Gly He Gin Tyr Phe Leu Gin Arg Arg Pro Ser Met 
490 495 500 

aag acc ctg ttt gtg gac age tac agt gag atg ctt ttc ttt ctg cag 2669 
Lys Thr Leu Phe Val Asp Ser Tyr Ser Glu Met Leu Phe Phe Leu Gin 
505 510 515 

tea ctg ttc atg ctg gee acc gtg gtg ctg tac ttc age cac etc aag 2717 
Ser Leu Phe Met Leu Ala Thr Val Val Leu Tyr Phe Ser His Leu Lys 
520 525 530 535 

gag tat gtg get tec atg gta ttc tec ctg gcc ttg ggc tgg acc aac 2765 
Glu Tyr Val Ala Ser Met Val Phe Ser Leu Ala Leu Gly Trp Thr Asn 
540 545 550 

atg etc tac tac ace cgc ggt ttc cag cag atg ggc ate tat gee gtc 2813 
Met Leu Tyr Tyr Thr Arg Gly Phe Gin Gin Met Gly He Tyr Ala Val 
555 560 565 

atg ata gag aag atg ate ctg aga gae ctg tgc cgt ttc atg ttt gtc 2861 
Met He Glu Lys Met He Leu Arg Asp Leu Cys Arg Phe Met Phe Val 
570 575 580 

tac ate gtc ttc ttg ttc ggg ttt tec aca gcg gtg gtg acg ctg att 2909 
Tyr He Val Phe Leu Phe Gly Phe Ser Thr Ala Val Val Thr Leu He 
585 590 595 

gaa gac ggg aag aat gac tec ctg ccg tct gag tec acg teg cac agg 2957 
Glu Asp Gly Lys Asn Asp Ser Leu Pro Ser Glu Ser Thr Ser His Arg 
600 605 610 615 



tgg egg ggg cct gcc tgc agg ccc ccc gat age tec tac aac age ctg 
Trp Arg Gly Pro Ala Cys Arg Pro Pro Asp Ser Ser Tyr Asn Ser Leu 
620 625 630 



3005 
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tac tec acc tgc ctg gag ctg ttc aag ttc acc ate ggc atg ggc gac 3053 
Tyr Ser Thr Cys Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly Asp 
635 640 645 



ctg gag ttc act gag aac tat gac ttc aag got gtc ttc ate ate ctg 
Leu Glu Phe Thr Glu Asn Tyr Asp Phe Lys Ala Val Phe lie He Leu 
650 655 660 



agg gtg gac gag gtg aac tgg acc acc tgg aac acc aac gtg ggc ate 
Arg Val Asp Glu Val Asn Trp Thr Thr Trp Asn Thr Asn Val Gly He 
745 750 755 



ttc tec ctg egg tea age aga gtt tea ggc aga cac tgg aag aac ttt 
Phe Ser Leu Arg Ser Ser Arg Val Ser Gly Arg His Trp Lys Asn Phe 
780 785 790 



3101 



ctg ctg gee tat gta att etc ace tac ate etc ctg etc aac atg etc 3149 

Leu Leu Ala Tyr Val He Leu Thr Tyr He Leu Leu Leu Asn Met Leu 

665 670 675 

ate gee etc atg ggt gag act gtc aac aag ate gca cag gag age aag 3197 

He Ala Leu Met Gly Glu Thr Val Asn Lys He Ala Gin Glu Ser Lys 

680 685 690 695 

aac ate tgg aag ctg cag aga gee ate ace ate ctg gac acg gag aag 3245 

Asn He Trp Lys Leu Gin Arg Ala He Thr He Leu Asp Thr Glu Lys 

700 705 710 

age ttc ctt aag tgc atg agg aag gee ttc cge tea ggc aag ctg ctg 3293 

Ser Phe Leu Lys Cys Met Arg Lys Ala Phe Arg Ser Gly Lys Leu Leu 

715 720 725 

cag gtg ggg tac aca cct gat ggc aag gac gac tac egg tgg tgc ttc 3341 

Gin Val Gly Tyr Thr Pro Asp Gly Lys Asp Asp Tyr Arg Trp Cys Phe 

730 735 740 



3389 



ate aac gaa gac ccg ggc aac tgt gag ggc gtc aag cge acc ctg age 3437 
He Asn Glu Asp Pro Gly Asn Cys Glu Gly Val Lys Arg Thr Leu Ser 
760 765 770 775 



3485 



gee ctg gtc ccc ctt tta aga gag gca agt get cga gat agg cag tct 3533 

Ala Leu Val Pro Leu Leu Arg Glu Ala Ser Ala Arg Asp hrq Gin Ser 
795 800 805 

get cag ccc gag gaa gtt tat ctg cga cag ttt tea ggg tct ctg aag 

Ala Gin Pro Glu Glu Val Tyr Leu Arg Gin Phe Ser Gly Ser Leu Lys 

810 815 820 



3581 



cca gag gac get gag gtc ttc aag agt cct gee get tee ggg gag aag 3629 
Pro Glu Asp Ala Glu Val Phe Lys Ser Pro Ala Ala Ser Gly Glu Lys 
825 830 835 

tgaggacgtc acgcagacag cactgtcaac actgggcett aggagaccce gttgccacgg 3689 

ggggctgctg agggaacace agtgctctgt eagcagcetg gcetggtctg tgcetgecca 374 9 

gcatgttccc aaatctgtgc tggacaagct gtgggaagcg ttcttggaag catggggagt 3809 
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gatgtacatc caaccgtcac tgtccccaag tgaatctcct aacagacttt caggttttta 3869 
ctcactttac taaaaaaaaa aaaaaaaggg cggccgctta 3909 



<210> 2 
<211> 839 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
1 5 10 15 

Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr He Thr Val Ser Pro 
65 70 75 80 

Val He Thr He Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

Tyr Asp Arg Arg Ser He Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 

Thr Asp Asn Glu Phe Lys Asp Pro Glu Thr Gly Lys Thr Cys Leu Leu 
145 150 155 160 

Lys Ala Met Leu Asn Leu His Asp Gly Gin Asn Thr Thr He Pro Leu 
165 170 175 

Leu Leu Glu He Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His He 
195 200 205 

Ala He Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 
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Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

Ala Cys Thr Asn Gin Leu Gly He Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

Trp Gin Thr Ala Asp He Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 



Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

Phe Val Thr Ser Met Tyr Asn Glu He Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys He Gly Val Leu Ala Tyr 
340 345 350 

He Leu Gin Arg Glu He Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

Leu Ser Cys He Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val He 
385 390 395 400 

Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 

Glu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

Arg He Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met He He 
435 440 445 

Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 455 460 

Lys Met Glu Lys He Gly Asp Tyr Phe Arg Val Thr Gly Glu He Leu 
465 470 475 480 

Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly He Gin Tyr Phe 
485 490 495 

Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 
500 505 510 



Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 
515 520 525 

Leu Tyr Phe Ser His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 
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530 



535 



540 



Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 

Gin Met Gly lie Tyr Ala Val Met He Glu Lys Met He Leu Arg Asp 
565 570 575 

Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 
580 585 590 

Thr Ala Val Val Thr Leu He Glu Asp Gly Lys Asn Asp Ser Leu Pro 
595 600 605 



ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 
610 615 620 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 
625 630 635 640 

Phe Thr He Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

Lys Ala Val Phe He He Leu Leu Leu Ala Tyr Val He Leu Thr Tyr 
660 665 670 

He Leu Leu Leu Asn Met Leu He Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

Lys He Ala Gin Glu Ser Lys Asn He Trp Lys Leu Gin Arg Ala He 
690 695 700 

Thr He Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

Trp Asn Thr Asn Val Gly He He Asn Glu Asp Pro Gly Asn Cys Glu 
755 '760 765 

Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 

Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 



Pro Ala Ala Ser Gly Glu Lys 
835 
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<210> 3 

<211> 2517 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2517) 

<400> 3 

atg aag aaa tgg age age aca gac ttg ggg aca get gcg gac cca etc 48 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
1 5 10 15 



caa aag gac acc tgc cca gac ccc ctg gat gga gac cct aac tec agg 
Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 



96 



cca cct cca gee aag ccc cag etc ccc acg gee aag age cgc ace egg 14 A 
Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

etc ttt ggg aag ggt gac teg gag gag get tte eeg gtg gat tgc ccc 192 
Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 



cae gag gaa ggt gag ttg gac tec tgc cog ace ate aca gtc age cct 

His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr He Thr Val Ser Pro 

65 70 75 80 

gtt ate acc ate cag agg cca gga gac ggc ccc ace ggt gee agg ctg 

Val He Thr He Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 

85 90 95 



240 



288 



ctg tec cag gac tct gtc gee gee age ace gag aag acc etc agg etc 336 

Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

tat gat cgc agg agt ate ttt gaa gee gtt get cag aat aac tgc cag 384 

Tyr Asp Arg Arg Ser He Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

gat ctg gag age ctg ctg etc tte ctg cag aag age aag aag cae etc 432 

Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 



aca gac aac gag tte aaa gac cct gag 

Thr Asp Asn Glu Phe Lys Asp Pro Glu 

145 150 

aaa gcc atg etc aac ctg cac gac gga 

Lys Ala Met Leu Asn Leu His Asp Gly 
165 

etc ctg gag ate gcg egg caa acg gac 



aca ggg aag acc tgt ctg ctg 480 
Thr Gly Lys Thr Cys Leu Leu 
155 160 

cag aac ace acc ate ccc ctg 528 
Gin Asn Thr Thr He Pro Leu 
170 175 

age ctg aag gag ctt gtc aac 576 
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Leu Leu Glu lie Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

gcc age tac acg gac age tac tac aag ggc cag aca gca ctg cac ate 624 
Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His He 
195 200 205 

gee ate gag aga cgc aac atg gcc ctg gtg acc etc ctg gtg gag aac 672 
Ala He Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 

gga gca gac gtc cag get gcg gcc cat ggg gac ttc ttt aag aaa acc 720 
Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

aaa ggg egg cct gga ttc tac ttc ggt gaa ctg ecc ctg tec ctg gcc 768 
Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

gcg tgc acc aac cag ctg ggc ate gtg aag ttc ctg ctg cag aac tec 816 
Ala Cys Thr Asn Gin Leu Gly He Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

tgg cag acg gcc gac ate age gcc agg gac teg gtg ggc aac acg gtg 864 
Trp Gin Thr Ala Asp He Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 

ctg cac gcc ctg gtg gag gtg gee gac aac acg gcc gac aac acg aag 912 
Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

ttt gtg acg age atg tac aat gag att ctg atg ctg ggg gcc aaa ctg 960 
Phe Val Thr Ser Met Tyr Asn Glu He Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

cac ccg acg ctg aag ctg gag gag etc ace aac aag aag gga atg acg 1008 
His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

ccg ctg get ctg gca get ggg acc ggg aag ate ggg gtc ttg gee tat 1056 
Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys He Gly Val Leu Ala Tyr 
340 345 350 

att etc cag egg gag ate cag gag ccc gag tgc agg cac ctg tec agg 1104 
He Leu Gin Arg Glu He Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

aag ttc ace gag tgg gcc tac ggg ccc gtg cac tec teg ctg tac gac 1152 
Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

ctg tee tgc ate gac acc tgc gag aag aac teg gtg ctg gag gtg ate 1200 
Leu Ser Cys He Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val He 
385 390 395 400 



gcc tac age age age gag acc cct aat cgc cac gac atg etc ttg gtg 
Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 



1248 



wo 00/29577 



PCT/US99/26701 



- 11- 

gag ccg ctg aac cga etc ctg cag gac aag tgg gac aga ttc gtc aag 

Giu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

cgc ate ttc tac ttc aac ttc ctg gtc tac tgc ctg tac atg ate ate 

Arg He Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met He He 

435 440 445 



aag atg gaa aaa att gga gac tat ttc cga gtt act gga gag ate ctg 
Lys Met Glu Lys He Gly Asp Tyr Phe Arg Val Thr Gly Glu He Leu 



465 



470 475 480 



gag atg ctt ttc ttt ctg cag tea ctg ttc atg ctg gee acc gtg gtg 
Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 
515 520 525 



ctg gee ttg ggc tgg acc aac atg etc tac tac acc cgc ggt ttc cag 
Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
550 555 560 



545 



aca gcg gtg gtg acg ctg att gaa gac ggg aag aat gac tec ctg ccg 

Thr Ala Val Val Thr Leu He Glu Asp Gly Lys Asn Asp Ser Leu Pro 

595 600 605 

tct gag tec acg teg cac agg tgg egg ggg cct gee tgc agg cce cec 

Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 

610 615 620 

gat age tec tac aac age ctg tac tee acc tgc ctg gag ctg ttc aag 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 

625 630 635 640 

ttc acc ate ggc atg ggc gac ctg gag ttc act gag aac tat gac ttc 



1296 



1344 



ttc ace atg get gee tac tac agg cce gtg gat ggc ttg cct cce ttt 1392 
Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 455 460 



1440 



1488 



tct gtg tta gga gga gtc tac ttc ttt ttc cga ggg att cag tat ttc 
Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly He Gin Tyr Phe 
485 490 495 



ctg cag agg egg ccg teg atg aag acc ctg ttt gtg gac age tac agt 1536 
Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 
500 505 510 



1584 



ctg tac ttc age cac etc aag gag tat gtg get tec atg gta ttc tec 1632 
Leu Tyr Phe Ser His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 
530 535 540 



1680 



1728 



cag atg ggc ate tat gee gtc atg ata gag aag atg ate ctg aga gac 

Gin Met Gly He Tyr Ala Val Met He Glu Lys Met He Leu Arg Asp 
565 570 575 

ctg tgc cgt ttc atg ttt gtc tac ate gtc ttc ttg ttc ggg ttt tee 1776 

Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 

580 585 590 



1824 



1872 



1920 



1968 
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Phe Thr lie Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

aag get gtc ttc ate ate etg ctg ctg gee tat gta att etc acc tac 
Lys Ala Val Phe lie lie Leu Leu Leu Ala Tyr Val He Leu Thr Tyr 
660 665 670 

ate etc ctg etc aac atg etc ate gee etc atg ggt gag act gtc aac 
He Leu Leu Leu Asn Met Leu He Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 



tgg aac ace aac gtg gge ate ate aac gaa gac ceg ggc aac tgt gag 

Trp Asn Thr Asn Val Gly He He Asn Glu Asp Pro Gly Asn Cys Glu 

755 760 765 

ggc gtc aag cgc acc ctg age ttc tec etg egg tea age aga gtt tea 

Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 

770 775 780 

gge aga eac tgg aag aac ttt gee ctg gtc ccc ctt tta aga gag gea 

Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 

785 790 795 BOO 

agt get ega gat agg eag tet get cag ccc gag gaa gtt tat ctg ega 

Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 

805 810 815 



2016 



2064 



aag ate gea cag gag age aag aac ate tgg aag ctg eag aga gcc ate 2112 

Lys He Ala Gin Glu Ser Lys Asn He Trp Lys Leu Gin Arg Ala He 

690 695 700 

acc ate ctg gac acg gag aag age ttc ctt aag tge atg agg aag gee 2160 

Thr He Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 

705 710 715 720 

ttc cgc tea ggc aag ctg ctg cag gtg ggg tac aca cet gat ggc aag 

Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 

725 730 735 

gac gac tac egg tgg tge ttc agg gtg gac gag gtg aac tgg acc acc 2256 

Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 

740 745 750 



2208 



2304 



2352 



2400 



2448 



2496 



cag ttt tea ggg tet ctg aag cea gag gac get gag gtc ttc aag agt 
Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 

cet gee get tec ggg gag aag 2517 
Pro Ala Ala Ser Gly Glu Lys 
835 

<210> 4 
<211> 2809 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
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<222> (361) . . (2652) 



<400> 4 

ggctagcctg tcctgacagg ggagagttaa gctcccgttc tccaccgtgc cggctggcca 60 

ggtgggctga gggtgaccga gagaccagaa cctgcttgct ggagcttagt gctcagagct 120 

ggggagggag gttccgccgc tcctctgctg tcagcgccgg cagcccctcc cggcttcact 180 

tcctcccgca gcccctgcta ctgagaagct ccgggatccc agcagccgcc acgccctggc 24 0 

ctcagcctgc ggggctccag tcaggccaac accgacgcgc agctgggagg aagacaggac 300 

ccttgacatc tccatctgca cagaggtcct ggctggaccg agcagcctcc tcctcctagg 360 



atg acc tea ccc tec age tct oca gtt ttc agg ttg gag aca tta gat 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 

15 10 15 

gga ggc eaa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 

20 25 30 



408 



456 



ggg age ggg ctg cct ccc atg gag tea eag ttc cag ggc gag gac egg 

Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 

35 40 45 

aaa ttc gcc cct cag ata aga gtc aac etc aac tac cga aag gga aca 

Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 

50 55 60 

ggt gee agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 

Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 

65 70 75 80 

aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ett cca gag 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 

85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 

Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 

100 105 110 

ggc tee aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 

115 120 125 

gac gga gtc aat gcc tgc att ctg cca ctg ctg cag ate gac agg gac 

Asp Gly Val Asn Ala Cys He Leu Pro Leu Leu Gin He Asp Arg Asp 

130 135 140 

tct ggc aat cct cag ccc ctg gta aat gcc cag tgc aca gat gac tat 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 

145 150 155 160 

tac cga ggc cac age get ctg cac ate gcc att gag aag agg agt ctg 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu 



504 



552 



600 



648 



696 



744 



792 



840 



888 
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cag tgt gtg 
Gin Cys Val 



gcc tgc ggc 
Ala Cys Gly 
195 

ggt gag eta 
Gly Glu Leu 
210 

gta age tac 
Val Ser Tyr 
225 

act gac tec 
Thr Asp Ser 



gac aac tea 
Asp Asn Ser 



etc etc caa 
Leu Leu Gin 
275 

ate cgc aac 
He Arg Asn 
290 

ggc aag ate 
Gly Lys He 
305 

ctg age eac 
Leu Ser His 



egg gtg teg 
Arg Val Ser 



tea gtg ctg 
Ser Val Leu 
355 

cga atg gtc 
Arg Met Val 
370 

gat ctg etc 
Asp Leu Leu 
385 



165 

aag etc ctg 
Lys Leu Leu 
180 

cgc ttc ttc 
Arg Phe Phe 



cce etc tet 
Pro Leu Ser 



etc ctg gag 
Leu Leu Glu 
230 

cag ggc aac 
Gin Gly Asn 
245 

get gag aac 
Ala Glu Asn 
260 



get ggg gee 
Ala Gly Ala 



ctg cag gat 
Leu Gin Asp 



gag att ttc 
Glu He Phe 
310 

ett tee cga 
Leu Ser Arg 
325 

ctg tat gac 
Leu Tyr Asp 
340 

gag ate att 
Glu He He 



gtt ttg gag 
Val Leu Glu 



ate cce aag 
He Pro Lys 
390 



gtg gag aat 
Val Glu Asn 
185 

cag aag ggc 
Gin Lys Gly 
200 

ttg gee get 
Leu Ala Ala 
215 

aac cea eac 
Asn Pro His 



aca gtc ctg 
Thr Val Leu 



att gca ctg 
He Ala Leu 
265 



cgc etc tgc 
Arg Leu Cys 
280 

etc aeg ect 
Leu Thr Pro 
295 

agg eac ate 
hrg His He 



aag ttc ace 
Lys Phe Thr 



ctg get tct 
Leu Ala Ser 
345 

gcc ttt eat 
Ala Phe His 
360 

cce ctg aac 
Pro Leu Asn 
375 

ttc ttc tta 
Phe Phe Leu 



- 14- 

170 

ggg gcc aat 
Gly Ala Asn 



eaa ggg act 
Gin Gly Thr 



tgc ace aag 
Cys Thr Lys 
220 

cag cce gcc 
Gin Pro Ala 
235 

cat gcc eta 
His Ala Leu 
250 

gtg ace age 
Val Thr Ser 



ect ace gtg 
Pro Thr Val 



ctg aag ctg 
Leu Lys Leu 
300 

ctg cag egg 
Leu Gin Arg 
315 

gag tgg tgc 
Glu Trp Cys 
330 

gtg gac age 
Val Asp Ser 



tgc aag age 
Cys Lys Ser 



aaa ctg ctg 
Lys Leu Leu 
380 

aac ttc ctg 
Asn Phe Leu 
395 



175 

gtg cat gcc 
Val His Ala 
190 

tgc ttt tat 
Cys Phe Tyr 
205 

cag tgg gat 
Gin Trp Asp 



age ctg cag 
Ser Leu Gin 



gtg atg ate 
Val Met He 
255 

atg tat gat 
Met Tyr Asp 
270 



cag ctt gag 
Gin Leu Glu 
285 

gcc gee aag 
Ala Ala Lys 



gag ttt tea 
Glu Phe Ser 



tat ggg ect 
Tyr Gly Pro 
335 

tgt gag gag 
Cys Glu Glu 
350 

ccg cac cga 
Pro His Arg 
365 

cag gcg aaa 
Gin Ala Lys 



tgt aat ctg 
Cys Asn Leu 



egg 936 
Arg 



ttc 984 
Phe 



gtg 1032 
Val 



gee 1080 

Ala 

240 

teg 1128 
Ser 



ggg 1176 
Gly 



gac 1224 
Asp 



gag 1272 
Glu 



gga 1320 

Gly 

320 

gtc 1368 
Val 



aac 1416 
Asn 



eac 1464 
His 



tgg 1512 
Trp 



ate 1560 

He 

400 
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tac atg ttc ate ttc acc get gtt gcc tac cat cag cct acc ctg aag 1608 

Tyr Met Phe lie Phe Thr Ala Val hla Tyr His Gin Pro Thr Leu Lys 
405 410 415 

aag cag gcc gcc cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 1656 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

ctg acg ggc cac ate ett ate ctg eta ggg ggg ate tac etc etc gtg 1704 

Leu Thr Gly His He Leu He Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 

ggc cag ctg tgg tac ttc tgg egg cgc cac gtg ttc ate tgg ate teg 1752 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 

450 455 460 

ttc ata gae age tac ttt gaa ate etc ttc ctg ttc cag gcc ctg etc 1800 

Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 



aca gtg gtg tec cag gtg ctg tgt ttc ctg gcc ate gag tgg tac ctg 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 

485 490 495 

ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ett tac 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 

500 505 510 



1848 



1896 



tat aca cgt ggc ttc cag 
Tyr Thr Arg Gly Phe Gin 
515 

aag gtc ate ctg egg gac 
Lys Val He Leu Arg Asp 
530 

ttc ctt ttc ggc ttc get 
Phe Leu Phe Gly Phe Ala 
545 550 



cac aca ggc ate tac agt 
His Thr Gly He Tyr Ser 
520 

ctg ctg cgc ttc ctt ctg 
Leu Leu Arg Phe Leu Leu 
535 540 

gta gcc ctg gtg age ctg 
Val Ala Leu Val Ser Leu 
555 



gtc atg ate cag 1944 

Val Met He Gin 

525 

ate tac tta gtc 1992 
He Tyr Leu Val 



age cag gag get 2040 
Ser Gin Glu Ala 
560 



tgg cgc ccc gaa get cct aca ggc ccc aat gee aca gag tea gtg cag 2088 
Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 

565 570 575 

ccc atg gag gga cag gag gae gag ggc aac ggg gcc cag tac agg ggt 2136 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 

580 585 590 

ate ctg gaa gcc tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 2184 

He Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly 

595 600 605 

gag ctg gee ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 2232 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 

610 615 620 

ctg ctg ctg gcc tac gtg ctg etc ace tac ate ctg ctg etc aac atg 2280 
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Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 640 

etc ate gcc etc atg age gag acc gte aac agt gtc gcc act gac age 
Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gcc ate tct gtc ctg gag atg gag 
Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 , 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 

agg gtg gag gag gtg aac tgg get tea tgg gag cag acg ctg cct acg 
Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

ctg tgt gag gac ccg tea ggg gca ggt gtc cct ega act etc gag aac 
Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 . 730 735 

cct gte ctg get tec cct ecc aag gag gat gag gat ggt gcc tct gag 
Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 

gaa aac tat gtg ecc gtc cag etc etc cag tec aac tgatggccca 
Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 



2328 



2376 



2424 



2472 



2520 



2568 



2616 



2662 



gatgeagcag gaggccagag gacagagcag aggatctttc caaccaeatc tgctggctet 2722 
ggggtcccag tgaattctgg tggeaaatat atattttcac taactcaaaa aaaaaaaaaa 2782 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 2809 



<210> 5 
<211> 764 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
1 5 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 

Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 
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Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin He Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 



Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 .240 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 . 



Asp Asn Ser Ala Glu Asn He Ala 
260 

Leu Leu Gin Ala Gly Ala Arg Leu 
275 280 

He Arg Asn Leu Gin Asp Leu Thr 
290 295 

Gly Lys He Glu He Phe Arg His 
305 310 

Leu Ser His Leu Ser Arg Lys Phe 
325 

Arg Val Ser Leu Tyr Asp Leu Ala 
340 

Ser Val Leu Glu He He Ala Phe 
355 360 



Leu Val Thr Ser Met Tyr Asp Gly 
265 270 

Cys Pro Thr Val Gin Leu Glu Asp 
285 

Pro Leu Lys Leu Ala Ala Lys Glu 
300 

He Leu Gin Arg Glu Phe Ser Gly 
315 320 

Thr Glu Trp Cys Tyr Gly Pro Val 
330 335 

Ser Val Asp Ser Cys Glu Glu Asn 
345 350 

His Cys Lys Ser Pro His Arg His 
365 
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Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe L u Cys Asn Leu He 
385 390 395 400 

Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 
450 455 460 

Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu He Tyr Leu Val 
530 535 540 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 
545 550 555 560 

Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
580 585 590 

He Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly 
595 600 605 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu Leu Leu Asn Met 
625 630 635 640 

Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 



Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu 
660 665 670 



Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 
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Thr Val Gly Thr Lys Pro Asp Gly 
690 695 

Arg Val Glu Glu Val Asn Trp Ala 
705 710 

Leu Cys Glu Asp Pro Ser Gly Ala 
725 

Pro Val Leu Ala Ser Pro Pro Lys 
740 

Glu Asn Tyr Val Pro Val Gin Leu 
755 760 



Ser Pro Asp Glu Arg Trp Cys Phe 
700 

Ser Trp Glu Gin Thr Leu Pro Thr 
715 720 

Gly Val Pro Arg Thr Leu Glu Asn 
730 735 

Glu Asp Glu Asp Gly Ala Ser Glu 
745 750 

Leu Gin Ser Asn 



<210> 6 
<211> 2292 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2292) 

<400> 6 

atg acc tea ccc tec age tot cca gtt ttc agg ttg gag aca tta gat 48 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 96 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg 
Gly Ser Gly Leu 
35 

aaa ttc gcc cct 
Lys Phe Ala Pro 
50 

ggt gcc agt cag 
Gly Ala Ser Gin 
65 

aat gcg gtc tec 
Asn Ala Val Ser 



tac ctg age aag 
Tyr Leu Ser Lys 
100 

ggc tee aca ggt 
Gly Ser Thr Gly 



cct ccc atg 
Pro Pro Met 



cag ata aga 
Gin lie Arg 
55 

ccg gat cca 
Pro Asp Pro 
70 

egg ggt gtc 
Arg Gly Val 
85 

acc age aag 
Thr Ser Lys 



aag acg tgc 
Lys Thr Cys 



gag tea cag ttc 
Glu Ser Gin Phe 
40 

gtc aac etc aae 
Val Asn Leu Asn 



aac cga ttt gac 
Asn Arg Phe Asp 
75 

ccc gag gat ctg 
Pro Glu Asp Leu 
90 

tac etc ace gac 
Tyr Leu Thr Asp 
105 

ctg atg aag get 
Leu Met Lys Ala 



cag ggc gag 
Gin Gly Glu 
45 

tac cga aag 
Tyr Arg Lys 
60 

cga gat egg 
Arg Asp Arg 



get gga ett 
Ala Gly Leu 



teg gaa tac 
Ser Glu Tyr 
110 

gtg ctg aae 
Val Leu Asn 



gac egg 144 
Asp Arg 



gga aca 192 
Gly Thr 



etc ttc 240 
Leu Phe 
80 

cca gag 288 
Pro Glu 
95 

aca gag 336 
Thr Glu 



ett aag 384 
Leu Lys 
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115 120 125 

gac gga gtc aat gcc tgc att ctg cca ctg ctg cag ate gac agg gac 

Asp Gly Val Asn Ala Cys He Leu Pro Leu Leu Gin He Asp Arg Asp 

130 135 140 

tct ggc aat cct cag ccc ctg gta aat gcc cag tgc aca gat gac tat 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 

145 150 155 160 

tac cga ggc cac age get ctg cac ate gcc att gag aag agg agt ctg 

Tyr Arg Gly His Ser Ala Leu His He Ala lie Glu Lys Arg Ser Leu 

165 170 175 



432 



480 



528 



cag tgt gtg aag etc ctg gtg gag aat ggg gcc aat gtg cat gcc egg 576 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gcc tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 624 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 

195 200 205 

ggt gag eta ccc etc tct ttg gcc get tgc acc aag cag tgg gat gtg 672 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gcc age ctg cag gcc 720 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tee cag ggc aac aca gtc ctg cat gcc eta gtg atg ate teg 768 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg 816 

Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gcc cgc etc tgc cct acc gtg cag ctt gag gac 864 
Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 



ate cgc aac ctg cag gat etc acg cct ctg aag ctg gcc gee aag gag 
He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 



ctg age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc 
Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 



912 



ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 960 
Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 



1008 



egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1056 
Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 
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tea gtg ctg gag ate att gcc ttt cat tgc aag age ccg cac cga cac 1104 

Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 

Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

gat ctg etc ate ccc aag ttc tte tta aac ttc ctg tgt aat ctg ate 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 

385 390 395 400 

tac atg ttc ate ttc ace get gtt gee tac cat cag cct acc ctg aag 

Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 

405 410 415 

aag cag gcc gcc cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

ctg acg ggc cac ate ett ate ctg eta ggg ggg ate tac etc etc gtg 

Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val 
435 440 445 

ggc cag ctg tgg tac ttc tgg egg cge cac gtg ttc ate tgg ate teg 1392 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser 
450 455 460 



ttc ata gae age tac ttt gaa ate etc tte ctg ttc cag gee ctg etc 
Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 



aca gtg gtg tec cag gtg ctg tgt ttc ctg gcc ate gag tgg tac ctg 
Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 



tat aca cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 

515 520 525 

aag gtc ate ctg egg gae ctg ctg cgc ttc ett ctg ate tac tta gtc 

Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu He Tyr Leu Val 

530 535 540 

tte ett tte ggc ttc get gta gee ctg gtg age ctg age cag gag get 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 

545 550 555 560 

tgg cge ccc gaa get cct aca ggc ccc aat gcc aca gag tea gtg cag 

Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 

565 570 575 



1152 



1200 



1248 



1296 



1344 



1440 



1488 



ccc ctg ett gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ett tac 1536 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 



1584 



1632 



1680 



1728 



ccc atg gag gga cag gag gae gag ggc aac ggg gcc cag tac agg ggt 1776 
Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
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580 585 590 

ate ctg gaa gcc tec ttg gag cte ttc aaa ttc acc ate ggc atg ggc 

lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly 
595 600 605 

gag ctg gcc ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 

610 615 620 

ctg ctg ctg gcc tac gtg ctg etc acc tac ate ctg ctg etc aac atg 

Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu Leu Leu Asn Met 
625 630 635 640 



1824 



1872 



1920 



etc ate gcc etc atg age gag acc gtc aac agt gtc gcc act gac age 1968 
Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gee ate tct gtc ctg gag atg gag 2016 
Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gea ggt gtg atg ctg 2064 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

ace gtt ggc act aag cea gat ggc age ccg gat gag cgc tgg tgc ttc 2112 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 



agg gtg gag gag gtg aac tgg get tea 
Arg Val Glu Glu Val Asn Trp Ala Ser 
705 710 

ctg tgt gag gac ccg tea ggg gca ggt 
Leu Cys Glu Asp Pro Ser Gly Ala Gly 
725 

cet gtc ctg get tec cet cec aag gag 
Pro Val Leu Ala Ser Pro Pro Lys Glu 
740 745 

gaa aac tat gtg ecc gtc cag etc etc 
Glu Asn Tyr Val Pro Val Gin Leu Leu 
755 760 



tgg gag cag acg ctg cet acg 2160 
Trp Glu Gin Thr Leu Pro Thr 
715 720 

gtc cet cga act etc gag aac 2208 
Val Pro Arg Thr Leu Glu Asn 
730 735 

gat gag gat ggt gcc tct gag 2256 
Asp Glu Asp Gly Ala Ser Glu 
750 

cag tec aac 2292 
Gin Ser Asn 



<210> 7 
<211> 1489 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (3) . . (1310) 



<400> 7 
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gc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt 47 
Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly 
15 10 15 

gag eta ccc etc tct ttg gcc get tgc ace aag cag tgg gat gtg gta 95 
Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val 
20 25 30 



age tac etc ctg gag aac eca cac eag ccc gcc age ctg cag gcc act 
Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr 
35 40 45 



aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc 

Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu 
65 70 75 

etc caa get ggg gcc cgc etc tgc cct acc gtg cag ctt gag gac ate 

Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp He 

80 85 90 95 



143 



gac tec cag ggc aac aca gtc ctg cat gcc eta gtg atg ate teg gac 191 
Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser Asp 
50 55 60 



239 



287 



cgc aac ctg eag gat etc acg cct ctg aag ctg gcc gcc aag gag ggc 335 
Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly 
100 105 110 



aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg 
Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu 
115 120 125 



383 



age cac ctt tec ega aag ttc ace gag tgg tgc tat ggg cct gtc egg 431 
Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg 
130 135 140 



gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea 

Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser 

145 150 155 

gtg ctg gag ate att gee ttt cat tgc aag age ccg cac ega cac ega 

Val Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg 

160 165 170 175 



479 



527 



atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat 575 

Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp 
180 185 190 

ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac 623 

Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr 

195 200 205 

atg ttc ate ttc ace get gtt gcc tac cat cag cct acc ctg aag aag 671 

Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys 

210 215 220 

cag gcc gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg 719 

Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu 
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225 230 235 

acg ggc cac ate ctt ate etg eta ggg ggg ate tac cte etc gtg ggc 767 
Thr Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val Gly 
240 245 250 255 

cag etg tgg tac tte tgg egg cge cac gtg ttc ate tgg ate teg tte 815 
Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser Phe 
260 265 270 

ata gac age tac ttt gaa ate cte ttc etg ttc cag gee etg etc aea 863 
He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu Thr 
275 280 285 

gtg gtg tee cag gtg etg tgt tte etg gee ate gag tgg tac etg ecc 911 
Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu Pro 
290 295 300 

etg ett gtg tct gcg etg gtg etg ggc tgg etg aac etg ctt tac tat 959 
Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr 
305 310 315 

aea egt ggc tte cag cac aca ggc ate tac agt gtc atg ate cag aag 1007 
Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin Lys 
320 325 330 335 



aaa gee ate tet gte etg gag atg gag aat ggc tat tgg tgg tgc agg 1055 
Lys Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg 
340 345 350 

aag aag cag egg gea ggt gtg atg etg acc gtt ggc act aag eca gat 1103 
Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp 
355 360 365 

ggc age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg 1151 
Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp 
370 375 380 

get tea tgg gag cag acg etg cet acg etg tgt gag gae ccg tea ggg 1199 
Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly 
385 390 395 

gea ggt gte cet ega act etc gag aac cet gtc etg get tec cet ecc 124 7 
Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro 
400 405 410 415 

aag gag gat gag gat ggt gee tet gag gaa aac tat gtg ecc gtc cag 1295 
Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin 
420 425 430 

etc etc cag tec aac tgatggceca gatgeagcag gaggccagag gaeagagcag 1350 
Leu Leu Gin Ser Asn 
435 

aggatetttc caaccacatc tgctggctct ggggtcccag tgaattctgg tggcaaatat 1410 
atattttcac taactcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 1470 
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agcggacgcg tgggtcgac 1489 



<210> 8 
<211> 436 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 

Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 



Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp He Arg 
85 90 95 

Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr Met 
195 200 205 

Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 

Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 
225 230 235 240 

Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val Gly Gin 
245 250 255 

Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser Phe He 
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260 



265 



270 



Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu Thr Val 
275 280 285 

Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu Pro Leu 
290 295 300 

Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr Thr 
305 310 315 320 

Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin Lys Lys 
325 330 335 

Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 

Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 

Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 



Leu Gin Ser Asn 
435 



<210> 9 

<211> 1308 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(1308) 

<400> 9 

ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt gag 48 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 

15 10 15 



eta ccc etc tct ttg gee get tgc ace aag cag tgg gat gtg gta age 96 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 

20 25 30 

tac etc ctg gag aac cca eac cag ccc gee age ctg cag gee act gac 144 

Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 

35 40 45 
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tec cag ggc aac aca gtc ctg cat gcc eta gtg atgatc teg gac aac 192 
Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser Asp Asn 
. 50 55 60 

tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc etc 240 
Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 

caa get ggg gcc cge etc tgc cet acc gtg cag ctt gag gac ate cgc 288 
Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp He Arg 
85 90 95 

aac ctg cag gat etc acg cet ctg aag ctg gcc gee aag gag ggc aag 336 
Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

ate gag att tte agg cae ate ctg cag egg gag ttt tea gga etg age 384 
He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

cac ctt tec cga aag tte acc gag tgg tgc tat ggg cet gtc egg gtg 4 32 
His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea gtg 480 
Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

ctg gag ate att gcc ttt eat tgc aag age ccg cac cga cac cga atg 528 
Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

gtc gtt ttg gag cec etg aac aaa etg ctg cag gcg aaa tgg gat ctg 576 
Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

etc ate ccc aag tte tte tta aac tte ctg tgt aat ctg ate tac atg 624 
Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr Met 
195 200 205 

tte ate tte acc get gtt gcc tac cat cag cet acc ctg aag aag cag 672 
Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 

gcc gcc cet cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg acg 720 
Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 
225 230 235 240 

ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg ggc cag 768 
Gly His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val Gly Gin 
245 250 255 



ctg tgg tac tte tgg egg cge cac 
Leu Trp Tyr Phe Trp Arg Arg His 
260 

gac age tac ttt gaa ate etc tte 
Asp Ser Tyr Phe Glu He Leu Phe 



gtg tte ate tgg ate teg tte ata 816 
Val Phe He Trp He Ser Phe He 
265 270 

ctg tte cag gcc ctg etc aca gtg 864 
Leu Phe Gin Ala Leu Leu Thr Val 
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275 280 

gtg tec cag gtg ctg tgt ttc ctg gcc 
Val Ser Gin Val Leu Cys Phe Leu Ala 
290 295 

ctt gtg tct gcg ctg gtg ctg ggc tgg 
Leu Val Ser Ala Leu Val Leu Gly Trp 
305 310 

cgt ggc ttc cag cac aca ggc ate tac 
Arg Gly Phe Gin His Thr Gly He Tyr 
325 

gcc ate tct gtc ctg gag atg gag aat 
Ala He Ser Val Leu Glu Met Glu Asn 
340 345 



-28- 

285 

ate gag tgg tac ctg ccc ctg 912 
He Glu Trp Tyr Leu Pro Leu 
300 

ctg aac ctg ctt tac tat aca 960 
Leu Asn Leu Leu Tyr Tyr Thr 
315 320 

agt gtc atg ate cag aag aaa 1008 
Ser Val Met He Gin Lys Lys 
330 335 

ggc tat tgg tgg tgc agg aag 1056 
Gly Tyr Trp Trp Cys Arg Lys 
350 



aag cag egg gca ggt gtg atg ctg acc gtt ggc act aag cca gat ggc 
Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 



1104 



age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg get 1152 
Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 



tea tgg gag cag acg ctg cet acg ctg tgt gag gac ccg tea ggg gca 
Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

ggt gtc cct cga act etc gag aac cet gtc ctg get tec cct ccc aag 
Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

gag gat gag gat ggt gcc tct gag gaa aac tat gtg ccc gtc cag etc 
Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 

etc cag tec aac 
Leu Gin Ser Asn 
435 



1200 



1248 



1296 



1308 



<210> 10 

<211> 1794 

<212> DNA 

<213> Rattus sp. 
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<220> 

<221> CDS 

<222> (2) . . (1663) 

<400> 10 

g teg acc cac gcg tec get ctt tct ctg get gcg tgc aec aag cag tgg 49 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gee age ctg 97 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag gee acc gae tec ctg ggc aac aca gte ctg cat get ctg gta atg 145 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att gca gat aac teg cct gag aac agt gee ctg gtg ate cac atg tac 193 
He Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val He His Met Tyr 
50 55 60 

gae ggg ctt eta caa atg ggg gcg cge etc tgc cec act gtg cag ctt 241 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

gag gaa ate tee aac cac caa ggc etc aca eee ctg aaa eta gcc gee 289 
Glu Glu He Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 337 
Lys Glu Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag cce ctt tec cga aag ttt act gag tgg tgt tac 385 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gae ctg tec tct gtg gae age tgg 433 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 481 
Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 529 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gte tea aga ttc ttc ttc aac ttc gee tgc 577 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gte tac atg ttc ate ttc ace gte gtt gee tac cac cag cct 625 
Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 



tec ctg gat cag cca gee ate eee tea tea aaa gcg act ttt ggg gaa 



673 
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Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 721 
Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly He Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 769 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe He 
245 250 255 

tgg ate tea ttc atg gae age tac ttt gaa ate etc ttt etc ctt cag 817 
Trp He Ser Phe Met Asp Ser Tyr Phe Glu He Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 865 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tee eta gtg ctg ggc tgg ctg aac 913 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 



ctg ctt tac tac aca egg ggc ttt cag cacaca ggc ate tac agt gtc 961 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
305 310 315 320 

atg ate cag aag gtc ate ctt cga gae ctg etc cgt ttc ctg ctg gtc 1009 
Met He Gin Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gcc eta gta age ttg age 1057 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gcc cga agt ccc aaa gcc cet gaa gat aac aac tec aca gtg 1105 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc aeg gtg ggc cag gag gag gag cea get cca tat egg 1153 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gcc tec eta gag ctg ttc aag ttc ace att ggt atg 1201 
Ser He Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 124 9 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gcc tac gtc ctt etc acc tac gtc ctg ctg etc aac 1297 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gae aac 1345 
Met Leu He Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 
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age tgg age 
Ser Trp Ser 
450 

gag aat ggt 
Glu Asn Gly 
465 

ctg ctg aaa 
Leu Leu Lys 



tgc ttc agg 
Cys Phe Arg 



ccc acc tta 
Pro Thr Leu 
515 



ate tgg sag 
He Trp Lys 



tac tgg tgg 
Tyr Trp Trp 
470 

gtc ggc acc 
Val Gly Thr 
485 

gtg gag gaa 
Val Glu Glu 
500 

tct gag gat 
Ser Glu Asp 



ttg cag aaa 
Leu Gin Lys 
455 

tgc egg agg 
Cys Arg Arg 



agg ggg gat 
Arg Gly Asp 



gta aat tgg 
Val Asn Trp 
505 

cea tea ggg 
Pro Ser Gly 
520 



gee ate tct 
Ala He Ser 
460 

aag aaa cat 
Lys Lys His 
475 

ggt acc cct 
Gly Thr Pro 
4 90 

get get tgg 
Ala Ala Trp 



cca ggc ate 
Pro Gly He 



gtc ttg gag 
Val Leu Glu 



cgt gaa ggg 
Arg Glu Gly 



gat gag cge 
Asp Glu Arg 
495 

gag aag act 
Glu Lys Thr 
510 

act ggt aat 
Thr Gly Asn 
525 



atg 1393 
Met 



agg 1441 

Arg 

480 

tgg 1489 
Trp 



ett 1537 
Leu 



aaa 1585 
Lys 



1683 



aag aac cca acc tct aaa ccg ggg aag aac agt gee tea gag gaa gac 1633 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag tec ccc tgatggccca gatgcagcag 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 

caggetggca ggatggagta gggaatcttc ccagccacac cagaggctac tgaattttgg 1743 
tggaaatata aatatttttt ttgcataaaa aaaaaaaaaa agggeggceg c 1794 



<210> 11 
<211> 554 
<212> PRT 
<213> Rattus sp. 

<400> 11 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

He Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val He His Met Tyr 
50 55 60 

Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

Glu Glu He Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 
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Lys Glu Gly Lys lie Glu He Phe Arg His He Leu Gin Arg Glu Phe 
100 105 110 

Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

Glu Lys Trp Asp Arg Leu Val Ser . Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 



Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

Ser Leu Asp Gin Pro Ala He Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

Ser Met Leu Leu Leu Gly His He Leu He Leu Leu Gly Gly He Tyr 
225 230 235 240 

Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe He 
245 250 255 

Trp He Ser Phe Met Asp Ser Tyr Phe Glu He Leu Phe Leu Leu Gin 
260 265 270 

Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 
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Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val 
420 425 



Leu Leu Leu Asn 
430 



Met Leu He Ala Leu Met S r Glu Thr Val Asn His 
435 440 



Val Ala Asp Asn 
445 



Ser Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met 
450 455 460 



Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His 
465 470 475 



Arg Glu Gly Arg 
480 



Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro 
485 490 

Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp 
500 505 



Asp Glu Arg Trp 
495 

Glu Lys Thr Leu 
510 



Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly He 
515 520 



Thr Gly Asn Lys 
525 



Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 



His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 



<210> 12 
<211> 1662 
<212> DNA 
<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . , (1662) 

<400> 12 

teg acc cac gcg tec get ctt tct ctg 
Ser Thr His Ala Ser Ala Leu Ser Leu 
1 5 

gat gtg gtg ace tae etc ctg gag aac 
Asp Val Val Thr Tyr Leu Leu Glu Asn 
20 25 

gag gcc acc gac tec ctg ggc aac aca 
Glu Ala Thr Asp Ser Leu Gly Asn Thr 
35 40 

att gca gat aac teg cct gag aac agt 
He Ala Asp Asn Ser Pro Glu Asn Ser 
50 55 



get gcg tge aec aag cag tgg 48 
Ala Ala Cys Thr Lys Gin Trp 
10 15 

cca cac cag ccg gcc age ctg 96 
Pro His Gin Pro Ala Ser Leu 
30 

gtc ctg cat get ctg gta atg 144 
Val Leu His Ala Leu Val Met 
45 

gcc ctg gtg ate cac atg tae 192 
Ala Leu Val He His Met Tyr 
60 



gac ggg ctt eta caa atg ggg gcg cgc etc tge cce act gtg cag ctt 24 0 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 
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gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gee gee 288 
Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 336 
Lys Glu Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tae cag ccc ctt tec ega aag ttt act gag tgg tgt tac 384 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tae gac ctg tec tct gtg gac age tgg 432 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 480 
Glu Lys Asn Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 528 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gte tea aga ttc ttc ttc aac ttc gee tgc 576 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gte tac atg ttc ate ttc ace gtc gtt gee tac cac cag cct 624 
Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 672 
Ser Leu Asp Gin Pro Ala He Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tae 720 
Ser Met Leu Leu Leu Gly His He Leu He Leu Leu Gly Gly He Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 768 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe He 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 816 
Trp He Ser Phe Met Asp Ser Tyr Phe Glu He Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 864 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 912 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 

ctg ctt tae tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 960 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
305 310 315 320 
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atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1008 
Met lie Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gee eta gta age ttg age 1056 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gcc cga agt ccc aaa gee ect gaa gat aac aac tec aca gtg 1104 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1152 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gcc tec eta gag ctg ttc aag ttc ace att ggt atg 1200 
Ser lie Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 124 8 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gcc tac gtc ctt etc acc tac gtc ctg ctg etc aac 1296 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1344 
Met Leu lie Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 

age tgg age ate tgg aag ttg cag aaa gcc ate tct gtc ttg gag atg 1392 
Ser Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met 
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545 550 



<210> 13 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 13 

Ala Phe His Cys Lys Ser Pro His Arg His Arg Met Val Val Leu Glu 
1 5 10 15 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 14 

Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin Pro 
1 5 10 . 15 

Met Glu Gly Gin Glu Asp Glu Gly Asn 
20 25 

<210> 15 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 15 

Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin 
15 10 15 

Arg Ala Gly 



<210> 16 
<211> 20 
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<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

taggagaccc cgttgccacg 20 



<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; primer 
<400> 17 

gattcacttg gggacagtga eg 22 



<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 18 

ttaagctccc gttctccacc g 21 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

gctgcgggag gaagtgaagc 20 



<210> 20 
<211> 630 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 
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Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 



Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

Asp Gly Val Asn Ala Cys He Leu Pro Leu Leu Gin He Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

Tyr Arg Gly His Ser Ala Leu His He Ala He Glu Lys Arg Ser Leu 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
ISO 185 190 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 

Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 



Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly 

305 310 315 320 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 

325 330 335 
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Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 



Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu He 
385 390 395 400 

Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 



Lys Gin Ala Ala Pro 
420 

Leu Thr Gly His He 
435 

Gly Gin Leu Trp Tyr 
450 

Phe He Asp Ser Tyr 
465 

Thr Val Val Ser Gin 
485 



His Leu Lys Ala Glu Val 
425 

Leu He Leu Leu Gly Gly 
440 

Phe Trp Arg Arg His Val 
455 

Phe Glu He Leu Phe Leu 
470 475 

Val Leu Cys Phe Leu Ala 
490 



Gly Asn Ser Met Leu 
430 

He Tyr Leu Leu Val 
445 

Phe He Trp He Ser 
460 

Phe Gin Ala Leu Leu 
480 

He Glu Trp Tyr Leu 
4 95 



Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

Lys Lys Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys 
530 535 540 

Arg Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro 
545 550 555 560 

Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn 
565 570 575 



Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser 
580 585 590 

Gly Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro 

595 600 605 

Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val 
610 615 620 



Gin Leu Leu Gin Ser Asn 
625 630 
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